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OLYPHOSAT E TOLERANT PLANTS 

This is a continuation-in-part of our co-pending 
application having serial number 07/543,236 which was filed on 
June 25, 1990. 

R A CTCfrROT TN T) OF THE INVENTION 

Recent advances in genetic engineering have 
provided the requisite tools to transform plants to contain 
foreign genes. It is now possible to produce plants which have 
unique characteristics of agronomic importance. Certainly, 
one such advantageous trait is more cost effective, 
environmentally compatible weed control via herbicide 
tolerance. Herbicide-tolerant plants may reduce the need for 
tillage to control weeds thereby effectively reducing soil erosion. 

One herbicide which is the subject of much 
investigation in this regard is N-phosphonomethyl-glycine 
commonly referred to as glyphosate. Glyphosate inhibits the 
2Q shikimic acid pathway which leads to the biosynthesis of 
aromatic compounds including amino acids and vitamins. 
Specifically, glyphosate inhibits the conversion of 
phosphoenol pyruvic acid and 3-phospho shikimic acid to 5- 
enolpynivyl-3-phosphoshikimic add by inhibiting the enzyme 5- 
enolpyruvyl-3-phosphoshikimic acid synthase (EPSP synthase 
orEPSPS). 

It has been shown that glyphosate tolerant plants 
can be produced by inserting into the genome of the plant the 
capacity to produce a higher level of EPSP synthase which 
enzyme is preferably glyphosate tolerant (Shah et al M 1986). 
The introduction into plants of glyphosate degradation gene(s) 
could provide a means of conferring glyphosate tolerance to 
plants and/or to augment the tolerance of transgenic plants 
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already expressing a glyphosate tolerant EPSP synthase 
depending upon the physiological effects of the degradation 
products. 

Glyphosate metabolism (degradation) has been 
examined in a wide variety of plants and little degradation has 
been reported in most of those studies. In those instances 
where degradation has been reported, the initial breakdown 
product is usually aminomethylphosphonate (AMPA) 
(Couplarid, 1985; Marshall et al. t 1987). In these instances, it is 
not clear if glyphosate is metabolized by the plant or the 
contaminating microbes on the ieaf surface to which glyphosate 
was applied. AMPA has been reported to be much less 
phyto toxic than glyphosate for most plant species (Franz, 1985) 
but not for all plant species (Maier, 1983; Tanaka et al. f 1988). 
Glyphosate degradation in soils is much more extensive and 
rapid (Torstensson, 1985). The principal breakdown product 
identified is AMPA (Rueppel et al., 1977; Nomura and Hilton, 
1977); a phosphonate that can be metabolized by a wide variety of 
microorganisms (Zeleznick et al., 1963; Mastalerz et al., 1965; 
20 Cook et al., 1978; Daughton et al;, 1979a; 1979b; 1979c; Wackett et 
al., 1987a). A number of pure cultures of bacteria have been 
identified that degrade glyphosate by one of the two known 
routes (Moore et al., 1983; Talbot et al., 1984; Shinabarger and 
Braymer, 1986; Balthazor and Hallas, 1986; Kishore and Jacob, 
25 1987; Wackett et al., 1987a; Pipke et al., 1987a; Pipke et al-, 
1987b; Hallas et al., 1988; Jacob et al., 1985 and 1988; Pipke and 
Amrhein, 1988; Quinn et al., 1988 and 1989; Lerbs et al., 1990; 
Schowanek and Verstraete, 1990; Weidhase et al., 1990; Liu et 
al., 1991). A route involving a s"C-P lyase" that degrades 
30 glyphosate to sarcosine and inorganic orthophosphate (Pi) has 
been reported for a Pseudomonas sp. (Shinabarger and 
Braymer, 1986; Kishore and Jacob, 1987) and an Arthrobacter 
sp. (Pipke et al., 1987b). Pure cultures capable of degrading 
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glyphosate to AMPA have been reported for a Flavobacterium .■■t 
sp. (Balthazor and Hallas, 1986), for a Pseudomonas sp. (Jacob ; ^ 

et al., 1988) and for Arthrobacter atrocyaneus (Pipke and ■a'jU 
Amrhein, 1988). In addition, a large number of isolates that / J) 

5 convert glyphosate to AMPA have been identified from 
industrial activated sludges that treat glyphosate wastes 
(Hallas et al. t 1988). However, the number and nature of 
bacterial genes responsible for these degradations have not been 
heretofore determined nor have the gene(s) been isolated. 

10 Hence, in one aspect, an object of the present 

invention is to provide novel genes which encode a glyphosate 
metabolizing enzyme which converts glyphosate to 
aminomethylphosphonate and glyoxylate. 

Another object is to enhance the activity of the 

15 glyphosate metabolizing enzyme against glyphosate by 
replacement of specific amino acid residues. 

Another object of the present invention is to provide 
genetically modified plants which express a gene which 
encodes a glyphosate metabolizing enzyme and which exhibit 

20 enhanced tolerance to glyphosate herbicide. 

Another object is to demonstrate that a glyphosate 
metabolizing enzyme can be targeted to plastids using 
chloroplast transit peptides and the plastid targeted enzyme 
confers high level glyphosate tolerance. 

25 A further object is to provide a method for selecting 

transformed plant tissue using the glyphosate metabolizing 
enzyme as the selectable marker in the presence of inhibitory 
concentrations of glyphosate. 

These and other objects, aspects and features of the 

30 present invention will become evident to those skilled in the art 
from the following description and working examples. 
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SUMMARY O F TFTR INVENTION 



The present invention provides structural DNA 
constructs which encode a glyphosate oxido-reductase enzyme 
and which are useful in producing glyphosate degradation 
capability in heterologous microorganisms (e.g. bacteria and 
plants) and in producing glyphosate tolerant plants. 

In accomplishing the foregoing, there is provided, 
in accordance with one aspect of the present invention, a 
method of producing genetically transformed plants which are 
tolerant toward glyphosate herbicide, comprising the steps of: 
(a) inserting into the genome of a . plant cell a 
recombinant, double-stranded DNA molecule 
comprising 

(i) a promoter which functions in plant cells to 
cause the production of an RNA sequence, 

(ii) a structural DNA sequence that causes the 
production of an RNA sequence which 
encodes a glyphosate oxidoreductase enzyme, 

(iii) a 3' non-translated DNA sequence which 
functions in plant cells to cause the addition 
of polyadenylated nucleotides to the 3* end of 
the RNA sequence; 

where the promoter is heterologous with respect to 
the coding sequence and adapted to cause sufficient 
expression of said enzyme in plant tissue, including 
meristematic tissue, to enhance the glyphosate 
resistance of a plant cell transformed with said gene; 

(b) obtaining a transformed plant cell; and 

(c) regenerating from the transformed plant cell a 
genetically transformed plant which has increased 
tolerance to glyphosate herbicide. 
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In accordance with another aspect of the present 
invention, there is provided a recombinant, double-stranded 
DNA molecule comprising in sequence: 

(a) a promoter which functions in plants to cause the 
5 production of an RNA sequence; 

(b) a structural DNA sequence that causes the 
production of an RNA sequence which encodes a 
glyphosate oxidoreductase enzyme; and 

(c) a 3' non-translated region which functions in plants 
10 to cause the addition of polyadenylated nucleotides to 

the 3' end of the RNA sequence. 
There has also been provided, in accordance with another 
aspect of the present invention, bacterial and transformed plant 
cells that contain, respectively, DNA comprised of the above- 
15 mentioned elements (a), (b) and (c). 

In accordance with yet another aspect of the present 
invention, differentiated plants have been provided that 
comprise transformed plant cells, as described above, which 
exhibit tolerance toward glyphosate herbicide. 
20 In accordance with still another aspect of the present 

invention, there has been provided a method for selectively 
controlling weeds in a field containing a crop having planted 
crop seeds or plants comprising the steps of: 

(a) planting said crop seeds or plants which are 
25 glyphosate tolerant as a result of a recombinant 

double-stranded DNA molecule being inserted into 
said crop seed or plant, said DNA molecule having 

(i) a promoter sequence which functions in 
plants to cause the production of an RNA 

30 sequence, 

(ii) a structural DNA sequence which causes the 
production of RNA which encodes a 
glyphosate oxidoreductase enzyme, 
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(iii) a 3' non-translated region which encodes a 
polyadenylation signal which functions in 
plants to cause the addition of polyadenylated 
nucleotides to the 3* end of the RNA sequence, 

where the promoter is heterologous with respect to 
the coding sequence and adapted to cause sufficient 
expression of said enzyme in plant tissue, including 
meristematic tissue, to enhance the glyphosate 
10 tolerance of a plant cell transformed with said gene; 

and 

(b) applying to said crop and weeds in said field a 
sufficient amount of glyphosate herbicide to control 
said weeds without significantly affecting said crop. 
15 In a particularly preferred embodiment the double- 

stranded DNA molecule comprising a gene for plant expression 
comprises a structural DNA sequence encoding a fusion 
polypeptide containing an amino- terminal chloroplast transit 
peptide which is capable of causing importation of the carboxy- 
Z) terminal glyphosate oxidoreductase enzyme into the chloroplast 
of the plant cell expressing said gene. 

A further embodiment of the present invention is the 
use of the glyphosate oxidoreductase gene as a selectable 
marker to select and identify transformed plant tissue. 

25 

TlttTKF DESmiTPTTON QV THK DRAWINGS 

Figure 1 shows the DNA sequence for the full- 
length promoter of figwort mosaic virus (FMV). 
3Q Figure 2 shows the structural DNA sequence for a 

glyphosate oxidoreductase gene from bacterial isolate LBAA. 

Figure 3 shows a comparison of the manipulated 
structural glyphosate oxidoreductase gene versus a modified 
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glyphosate oxidoreductase gene adapted for enhanced 
expression in plants. The manipulated glyphosate 
oxidoreductase gene is displayed as the upper DNA sequence. 
Only the changes made in the modified gene are indicated in 

* the lower strand of sequences. 

Figure 4 shows a comparison of the manipulated 
structural glyphosate oxidoreductase gene versus a synthetic 
glyphosate oxidoreductase gene adapted for enhanced 
expression in plants. The manipulated glyphosate 

10 oxidoreductase gene is displayed as the upper DNA sequence. 

Figure 5 shows the structure of pMON 17032, a 
pMON886 vector containing the modified glyphosate 
oxidoreductase gene inserted as an En-CaMV35S-modified 
glyphosate oxidoreductase-NOS 3' cassette into the Notl site of 

15 the vector. The pMON886 vector is described in the text. 

Figure 6 shows the nucleotide sequence of the 
CTF1 chloroplast transit peptide derived from the A. thaliana 
SSU1A gene. 

Figure 7 shows the genetic/structural map of 
20 plasmid pMON17066 f a pMON979-type vector containing a gene 
encoding a CTP/synthetic glyphosate oxidoreductase fusion 
polypeptide. Related pMON979-type derivatives are pMON17065 
andpMON17073. 

Figure 8 shows the genetic/structural map of 
25 plasmid pMON17138, an example of a pMON981-type vector 
containing a gene encoding a CTP/synthetic glyphosate 
oxidoreductase fusion polypeptide. In this example the CTP1- 
synthetic glyphosate oxidoreductase gene has been cloned into 
pMON979 as a Xbal-BamUI fragment. 
30 Figur- 9 shows the nucleotide sequence of the 

CTP2 chloroplast transit peptide derived from the A. thaliana 
EPSPS gene. 
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Figure 10 shows the structural map of plasmid 
pMON17159. 

Figure 11 shows the structural map of plasmid 

pMON17226. 

> Figure 12 shows the structural map of plasmid 

pMON17164. 

STATEMENT OF THE INVENTION 

The expression of a plant gene which exists in 
double-stranded DNA form involves synthesis of messenger 
RNA (mRNA) from one strand of the DNA by RNA polymerase 
enzyme, and the subsequent processing of the mRNA primary 
transcript inside the nucleus. This processing involves a 

15 3* signal region which facilitates addition of polyadenylate 
nucleotides to the 3' end of the RNA. 

Transcription of DNA into mRNA is regulated by a 
region of DNA usually referred to as the "promoter." The 
promoter region contains a sequence of bases that signals RNA 

20 polymerase to associate with the DNA, and to initiate the 
transcription into mRNA using one of the DNA strands as a 
template to make a corresponding complementary strand of 
RNA. 

A number of promoters which are active in plant 
25 cells have been described in the literature. These include the 
nopaline synthase (NOS) and octopine synthase (OCS) 
promoters (which are carried on tumor-inducing plasmids of 
Agrobacterium tumefaciens), the caulimovirus promoters such 
as the cauliflower mosaic virus (CaMV) 19S and 35S promoters 
30 and the figwort mosaic virus (FMV) 35S promoter, the light- 
inducible promoter from the small subunit of ribulose bis- 
phosphate carboxylase (ssRUBISCO, a very abundant plant 
polypeptide). All of these promoters have been used to create 
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various types of DNA constructs which have been expressed in 
plants; see, e.g., PCT publication WO 84/02913 (Rogers et el., 
Monsanto). 

Promoters which are known or are found to cause 

5 transcription of DNA in plant cells can be used in the present 
invention. Such promoters may be obtained from a variety of 
sources such as plants and plant DNA viruses and include, but 
are not limited to, the CaMV35S and FMV35S promoters and 
promoters isolated from plant genes such as ssRUBISCO genes 

10 or the chlorophyll a/b binding proteins. As described below, it is 
preferred that the particular promoter selected should be 
capable of causing sufficient expression to result in the 
production of an effective amount of glyphosate oxidoreductase 
to render the plant substantially tolerant to glyphosate 

15 herbicides. The amount of glyphosate oxidoreductase needed to 
induce the desired tolerance may vary with the plant species. 

It is preferred that the promoters utilized have 
relatively high expression in all meristematic tissues in 
addition to other tissues inasmuch as it is now known that 

20 glyphosate is translocated and accumulated in this type of plant 
tissue. Alternatively, a combination of chimeric genes can be 
used to cumulatively result in the necessary overall expression 
level of glyphosate oxidoreductase enzyme to result in the 
glyphosate tolerant phenotype. 

25 The mRNA produced by a DNA construct of the 

present invention also contains a 5* non-translated leader 
sequence. This sequence can be derived from the promoter 
selected to express the gene, and can be specifically modified so 
as to increase translation of the mRNA. The 5* non-translated 

30 regions can also be obtained from viral RNAs, from suitable 
eukaryotic genes, or from a synthetic gene sequence. The 
present invention is not limited to constructs, as presented in 
the following examples* wherein the non-translated region is 
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derived from both the 5* non-translated sequence that 
accompanies the promoter sequence and part of the 5* non- 
translated region of the virus coat protein gene. Rather, the 
non-translated leader sequence can be derived from an 
5 unrelated promoter or coding sequence as discussed above. 

A preferred promoter for use in the present 
invention is the full-length transcript OSS) promoter from the 
figwort mosaic virus (FMV) which functions as a strong and 
uniform promoter for chimeric genes inserted into plants, 
10 particularly dicotyledons. In general* the resulting transgenic 
plants express the protein encoded by the inserted gene at a 
higher and more uniform level throughout the tissues and cells 
than the same gene driven by an enhanced CaMV35S promoter. 
Referring to Figure 1, the DNA sequence of the promoter is 
15 located between nucleotides 6368 and 6930 (SEQ ID NO:l) of the 
FMV genome. A 5' non- translated leader sequence is 
preferably coupled with the promoter and an exemplary leader 
sequence (SEQ ID NO:2) is shown in Figure 1. The leader 
sequence can be from the FMV genome itself or can be from a 
2) source other than FMV. 

The 3* non-translated region of the chimeric plant 
gene contains a polyadenylation signal which functions in 
plants to cause the addition of polyadenylate nucleotides to the 
3* end of the RNA. Examples of suitable 3* regions are (1) the 
^ 3* transcribed* non-translated regions containing the 
polyadenylation signal of Agrobacterium tumor-inducing (Ti) 
plasmid genes* such as the nopaline synthase (NOS) gene, and 
(2) plant genes such as the soybean storage protein genes and 
the small subunit of the ribulose-l,5-bisphosphate carboxylase 
30 (ssRUBISCO) gene. An example of a preferred 3* region is that 
from the ssRUBISCO gene from pea (E9), described in greater 
detail in the examples below. 
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The DNA constructs of the present invention also 
contain a structural coding sequence in double-Btranded DNA 
form, which encodes a glyphosate oxidoreductase enzyme 
which converts glyphosate to aminomethylphosphonate and 
5 glyoxylate. 

ftnmmflrv nf the Oly phoflate Oxidoreductase Reaction 

The enzyme glyphosate oxidoreductase catalyzes 
the cleavage of the C-N bond of glyphosate yielding 
10 aminomethyl phosphonate (AMPA) and glyoxylate as the 
reaction products. Under aerobib conditions, oxygen is utilized 
as a cosubstrate for the reaction. Other electron carriers such 
as phenazine methosulfate and ubiquinone stimulate the 
reaction under aerobic conditions. In the absence of oxygen, 
15 these compounds act as electron acceptors. 

The enzymatic reaction can be assayed by oxygen 
uptake using an oxygen electrode. The glyphosate oxido- 
reductase from LBAA does not produce hydrogen peroxide as a 
product of oxygen reduction. This enzyme has a stoichiometry 
20 of two moles of glyphosate oxidized per mole of oxygen 
consumed and produces two moles each of AMPA and 
glyoxylate as reaction products. 

An alternate method for the assay of glyphosate 
oxidoreductase involves reaction of the sample with 2,4- 
25 dinitxophenylhydxazine and determination of the amount of the 
glyoxylate-2,4-dinitrophenylhydrazone by HPLC analysis as 
described in detail in a later section. 

A third method for the assay of glyphosate 
oxidoreductase consists of using [3-nC]-glyphosate aB a 
X substrate; the radioactive AMPA produced by the enzyme is 
separated from the substrate by HPLC on anion exchange 
column as described later. The radioactivity associated witn 
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AMPA is a measure of the extent of the glyphosate 
oxidoreductase reaction. 

Glyphosate oxidoreductase from LBAA is a 
flavoprotein using FAD as a cofactor. One of the mechanisms 

5 we have proposed for the reaction catalyzed by this enzyme 
involves the reduction of the FAD at the active site of the enzyme 
by glyphosate. This leads to the formation of reduced FAD and 
a Schiffbase of aminomethylphosphonate with glyoxylate. The 
Schiff base iB hydrated by water and hydrolyzed to its 

10 components, AMPA and glyoxylate. The reduced flavin is 
reoxidized by molecular oxygen! We suggest that during the 
process of reoxidation of reduced FAD, an oxygenated flavin is 
produced as an intermediate. This flavin intermediate may 
catalyze the oxygenation of glyphosate yielding AMPA and 

15 glyoxylate. This hypothesis is in accordance with the observed 
stoichiometry and our inability to detect hydrogen peroxide in 
the reaction mixture. 

In addition to glyphosate, glyphosate oxido- 
reductase from LBAA oxidizes iminodiacetic acid (IDA) to 

20 glycine and glyoxylate. The rate of the reaction with EDA is 
significantly faster than with glyphosate. 

TRnjAtinn nf Efficient Glvphosate'-to-AMPA Degrading 

25 Bacteria capable of degrading glyphosate are 

known. (Hallas et al., 1988; Malik et al., 1989). A number of 
these bacteria were screened for the rapid degradation of 
glyphosate in the following manner: twenty three bacterial 
isolates were transferred from TSA (Trypticase Soya Agar; 

30 BBL) plates into medium A consisting of Dworkin-Foster salts 
medium containing glucose, gluconate and citrate (each at 
0.1%) as carbon source and containing glyphosate at 0.1 mM as 
the phosphorous source. 
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Dworkin-Foster minimal medium was made up by 
combining in 1 liter (with autoclaved H2O) 1 ml each of A, B and 
C and 10 ml of D, thiamine HC1 (5 mg), C-sources to final 
concentrations of 0.1% each and P-source (glyphosate or other 
5 phosphonates or Pi) to the required concentration: 

A. D-F Salts (1000X stock; per 100 ml; autoclaved): 

H3BO3 lmg 
MnS04.7H20 lmg 
!0 ZnS04.7H20 12.5 mg 

CuS04.5H20 8 mg 
NaMo03.3H20 1.7 mg 

B. FeSO4.7H20 (1000X stock; per 100 ml; autoclaved) 

0.1 g 

15 C. MgS04.7H20 (1000X stock; per 100 ml; autoclaved) 

20 g 

D. (NH4)2S04 (100X stock; per 100 ml; autoclaved) 

20 g 

20 Yeast Extract (YE; Difco) was added to a final concentration of 

0.01 or 0.001%. 

Each 1 ml of culture medium also contained 

approximately 200,000 cpm [3-i4C]glyphosate (Amersham; 

CFA.745). The cultures were incubated with s h a kin g at 30°C. 
25 Isolate LBAA showed significant growth at day one, while 

other test cultures showed little growth before day three. 

Determination of radioactivity (by scintillation counting) in the 

culture, cell pellet and culture supernatant (at day 4) revealed 

that total 14C radioactivity had decreased and that remaining 
30 was partitioned -1:1 in the supernatant and pellet, indicating 

that significant uptake and metabolism of glyphosate had taken 

place. 
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TABLE I ■ Glyphosate Metabolism bv LBAA Culture 

Sample U C ram 

5 control 18,631 

LBAA culture 11,327 

LBAA supernatant 6,007 

LBAA cells 4,932 

10 At day five, 75 ul of the culture supernatant of all test 

cultures was analyzed by HPLC as follows: a SYNCHRO PAK™ 
AX100 anion exchange column (P.J. Cobert) was used and the 
mobile phase consisted of 65 mM KH2PO4 (pH5.5 with NaOH; 
depending on the needs of the experiment the concentration of 

15 the phosphate buffer was varied from 50 to 75 mM in order to 
alter the retention times of the material), run isocratically and 
the eluted material monitored continuously using a radioactive 
detector. This analysis revealed, in one isolate in particular 
(LBAA), that the glyphosate peak (Retention Time [RT] » 7.0 

20 minutes in this analysis) was completely absent and a new 
peak of radioactivity had appeared, with the same RT as 
methylamine or N-acetylmethylamine (RT = 3.5 minutes). The 
collection of bacteria, of which strain LBAA formed a part, had 
been characterized as degrading glyphosate to AMP A (Hall as et 

25 al., 1988); the detection of methylamine or N-Acetylmethyl- 
amine suggested that the AMPA or N-AcetylAMPA was being 
metabolized by the LBAA M C-P lyase" activity to release the 
phosphate required for growth in this experiment. Strain 
LBAA was examined in greater detail. 



30 



Conversion of Ghmhosate to A MPA in Microbial Isolates 

For clarity and brevity of disclosure, the following 
description of the isolation of genes encoding glyphosate 
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oxidoreductase enzymes is directed to the isolation of such a 
gene from a bacterial isolate (LBAA). Those skilled in the art 
will recognize that the same or a similar strategy can be 
utilized to isolate such genes from other microbial isolates. 
5 The glyphosate degradation pathway was 

characterized in resting cells of glyphosate-grown strain LBAA 
as follows: the cells from a 100 ml culture of LBAA, grown in 
DF medium with glucose, gluconate and citrate as carbon 
sources and with thiamine and Yeast Extract (0.01%) to supply 
10 trace requirements (= medium DF3S) and with glyphosate at 
0.2 mM as a phosphorous source, were harvested at Klett = 200, 
washed twice with 20 ml of DF3S medium and the equivalent of 
20 ml cells resuspended in 100 ul of the same medium 
containing [3-«C]glyphosate (2.5 ul of 52 mCi/mmol). The cell 
15 mix was incubated at 30°C with shaking and samples (20 ul) 
were withdrawn at intervals. The samples were centrifuged 
and both the supernatant and cell pellets were analyzed by 
HPLC (the cell pellets were resuspended in 100 ul of acid-DF3S 
[= DF3S, 0.65N HC1], boiled for 5 minutes, centrifuged briefly 
20 and this supernatant was analyzed; an acidified glyphosate 
control was also examined). In two hours the amount of 
radioactivity in the glyphosate peak (RT = 7.8 minutes) in the 
supernatant had decreased to -33% of the starting level; about 
3% of the glyphosate was found within the cell. Material 
25 co-eluting with the methyl amine standard accounted for -5% of 
the starting counts in the supernatant and for -1.5% in the cell 
pellet. A new peak, accounting for ~1.59fc of the starting 
radioactivity with a RT of 7.7 minutes (glyphosate RT = 8.9 
minutes upon acidification in this experiment) was identified 
30 in the cell contents. The large decrease in overall radioactivity 
also suggested that the glyphosate was extensively metabolized 
in this experiment. The pathway was elucidated further in a 
subsequent experiment where the metabolism of [i*C]AMPA 
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was compared to that of [3-i4C]glyphosate (as above) in resting 
cells harvested at Klett 165 and resuspended at the equivalent to 
15 ml cells per 100 ul DF3S medium. The samples were 
analyzed by HPLC and consisted of whole cultures acidified and 

5 treated as described above. Within the first two hours of the 
glyphosate experiment, 25% of the radioactivity was found in 
the methylamine/N-acetylmethalamine peak (RT = 4.8 
minutes), 12.5% as AMPA (RT a 6.4 minutes), 30% as the peak 
alluded to above (RT =* 9.4 minutes) and 30% as glyphosate (RT = 

10 11.8 minutes). In the AMPA experiment 15% of the 
radioactivity was found as N r acetylmethylamine/methyl- 
amine, 59% as AMPA and 18% in the peak with RT = 9.4 
minutes. The modified form of AMPA was identified as 
N-acetylAMPA. A similar acetylation step has been inferred 

15 from the products identified in E. coli growing in 
aminomethylphosphonates as sole sources of P (Avila et al., 
1987). These data indicated that the glyphosate degradation 
pathway in LBAA is glyphosate AMPA methylamine) 
N-acetylAMPA N-acetylmethylamine. 

20 

Hlrming of the Glvnhoaate Oxirioreductafle Gene(s^ in E. coli 

Having established the glyphosate-to-AMPA 
conversion in strain LBAA, a direct approach for the cloning of 
the gene(s) involved in this conversion into E. coli was 
25 investigated. Cloning send genetic techniques, unless otherwise 
indicated, were generally those described (Maniatis et aL, 1982). 
Th6 cloning strategy was as follows: introduction of a cosmid 
bank of strain LBAA into E. coli and selection for the 
glyphosate-to-AMPA gene(s) by requiring growth on glyphosate 
X as a phosphorous (P) source. This selection relied on the use of 
AMPA generated by the glyphosate metabolizing enzyme as a P 
source, following the release of the Pi from the AMPA by the 
E. coli "C-P lyase." Most E. coli strains are incapable of 
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utilizing phoBphonates as P sources upon initial challenge, 
however these strains usually adapt rapidly, independently of 
RecA, to utilize phosphonates (become Mpu+) (Wackett et al. f 
1987b). E. coli Mpu+ was isolated from I?, coli SR200 (Leu-, Pro-, 

5 recA, hsdR, supE, Sznr, tonA t ) as follows: an aliquot of a fresh 
Lrbroth culture of E. coli SR200 was plated on MOPS (Neidhardt 
et al., 1974) complete agar (i.e., contains L-leucine and L- 
proline at 25 ug/ml and vitamin Bl [thiamine] at 10 ug/ml; agar 
s DIFCO "Purified") containing aminomethylphosphonate 

10 (AMPA; 0.2 mM; Sigma) as P source. 

MOPS medium is: 
10 ml 10X MOPS SALTS 
2 ml 0.5 mg/ml Thiamine HC1 
15 1 ml 20% glucose 

10XMOPS Salts are: 
for 100 ml 



25 



40 ml 


lMMOPSpH7.4 


4ml 


1M Tridne pH7.4 


1ml 


0.01MFeSO4.7H2O 


5ml 


i.9 m mua. 


1ml 


0.276 MK2SO4 


1ml 


0.5 mM CaCl2 


1ml 


0.528 MMgCl 2 


10 ml 


5MNaCl 


1ml 


0.5% L-Methionine 



lml Micronutrients 
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Micro nutrients are: 

3 x 10-9 M (NH4) 6 Mn70 2 4 

4 x 10-7 M H3BO4 
3 x 10-a M C0CI2 

5 1.6 x 10-8 MCuSO* 

8 x 10-a M MnCl 2 
lxlO-aMZnSO-i 

10 Six individual colonies were picked from this plate after three 
days incubation at 37°C and streaked on MOPS complete agar 
containing either AMPA or methylphosphonate (Alfa) as P 
source. One colony, designated E. coli SR200 Mpu+, was chosen 
from those that grew equally and uniformly on both 

15 phosphonate media. 

Chromosomal DNA was prepared from strain LBAA 
as follows: The cell pellet from a 100 ml L-Broth (Miller, 1972) 
late log phase culture of LBAA was resuspended in 10 ml of 
Solution I (Birnboim and Doly, 1979). SDS was added to a final 

20 concentration of 1% and the suspension was subjected to three 
freeze-thaw cycles, each consisting of immersion in dry ice for 
15 minutes and in water at 70°C for 10 minutes. The lysate was 
then extracted four times with equal volumes of 
phenol:chloroform (1:1; phenol saturated with TE) (TE = lOmM 

25 Tris pH8.0; l.OmM EDTA) and the phases separated by 
centrifugation (15000g; 10 minutes). The ethanol-precipitable 
material was pelleted from the supernatant by brief 
centrifugation (8000g; 5 minutes) following addition of two 
volumes of ethanol. The pellet was resuspended in 5 ml TE 

30 and dialyzed for 16 hours at 4°C against 2 liters TE. This 
preparation yielded a 6 ml DNA solution of 150 |xg/ml. 

Partially-restricted DNA was prepared as follows: 
Three 100 \ig aliquot samples of LBAA DNA. were treated for 1 
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hour at 37°C with restriction endonuclease Hindlll at rates of 
4, 2 and 1 enzyme unit/ug DNA, respectively. The DNA 
samples were pooled, made 0.25 mM with EDTA and extracted 
with equal volume of phenol: chloroform. Following the 
addition of NaAcetate and ethanol, the DNA was precipitated 
with two volumes of ethanol and pelleted by centrifugation 
(12000 g\ 10 minutes). The dried DNA pellet was resuspended 
in 500 ul TE and layered on a 10-40% Sucrose gradient (in 5% 
increments of 5.5 ml each) in 0.5 M NaCl, 50 mM Tris pH8.0, 5 
mM EDTA. Following centrifugation for 20 hours at 26,000 rpm 
in a SW28 rotor, the tubes were punctured and 1 ml fractions 
collected. Fifteen ul samples of each third fraction were run on 
0.8 % agarose gel and the size of the DNA determined by 
comparison with linearized lambda DNA and ifindlll- digested 
lambda DNA standards. Fractions containing DNA of 25-35 kb 
fragments were pooled, desalted on AMICON10 columns (7000 
rpm; 20°C; 45 minutes) and concentrated by precipitation. This 
procedure yielded 50 ug of LBAA DNA of the required size. 

Plasmid pHC79 (Hohn and Collins, 1980) DNA and a 
HindHE-phosphataae treated vector was prepared as described 
elsewhere (Maniatis e.t al., 1982). The ligation conditions were 
as follows: 
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Vector DNA (Hindlll- and calf 

alkaline phosphatase-treated) 1.6 \ig 

Size fractionated LBAA 
5 Hindlll fragments 3.75 ug 

10X ligation buffer 2.2 ^il 

250 mM Tris-HCl, pH 8.0; 
100 mM MgCb; 
10 100 mM Dithiothreitol; 

2 mM Spermidine 

T4 DNA ligase 
(Boehringer-Mannheim) 
15 (400 units/ill) 1.0 pi 

H2O to 22.0 nl 
18 hours at 16°C. 

20 The ligated DNA (4 ul) was packaged into lambda phage 
particles (Stratagene; Gigapack Gold) using the 
manufacturer's procedure. 

E. coli SR200 Mpu*, grown overnight in L-Broth (with 
maltose at 0.2%), was infected with 50 ul of the packaged DNA. 
25 Transformants were selected on MOPS complete agar plus 
ampidllin and with glyphosate at 0.2 mM as P source. 

Aliquot samples were also plated on MOPS 
(Neidhardt et aL, 1974) complete agar plus ampicillin 
containing Pi at ImM to titer the packaged cosmids. Cosmid 
30 transformants were isolated on this latter medium at a rate of 
-105 per ^g/LBAA Hindlll DNA after 2 days at 37°C. Colonies 
arose on the glyphosate-^gar from day 3 until day 10 with a final 
rate of 1 per 200-300 cosmids. Plasmid DNA was prepared from 
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twenty one cosmid tran9formants from the glypnosate plates. 
These cosmids fell into at least two classes based on the Hmdlll 
restriction pattern of the plasmid DNA. In Class I, all the 
cosmids had cloned 6.4 and 4.2 kb H in dill restriction 
5 fragments in common and in Class II, a -23 kbp fragment in 
common. Ten cosmids, representative of the diversity of the 
cloned fragments, were re- transformed into E. coli SR200 Mpu+ 
and the glypnosate utilization trait verified by selection for 
growth on MOPS complete agar plus ampicillin plus glypnosate 
10 plates. The final cell density achieved by the cultures using 
glypnosate (0.2mM in MOPS medium) as a P source was also 
determined and little difference could be discerned between the 
different transforxnants. Transformants were also inoculated 
into MOPS complete broth with AMPA at 0.1 mM as P source 
15 (to ensure the presence of W C-P lyase" activity) and after 24 
hours at 37 °C were diluted 100-fold into MOPS complete 
medium with glypnosate at 0.1 mM and [3-i*C]glyphosate 
(40,000 cpm/ml). All the cosmid- containing cells degraded 
glypnosate and generated N-acetylAMPA and N-acetylmethyl- 
20 amine, with no great difference in the rate. The 
N-acetylAMPA was found in the culture supernatant in these 
tests. One cosmid from Class I, identified as pMON7468, was 
chosen for further study. A second glypnosate oxidoreductase 
gene h" been identified from a Class II cosmid done. 
25 Cell-free lysates E. coli SR200 Mpu+/pMON7468 were 

prepared from cells grown on MOPS complete medium with 
glypnosate at 1.0 mM (and supplemented with 
L-phenylalanine, L-tyrosine and L-tryptophan each at 100 
ug/ml and with para-hydroxybenzoic acid, 2,3-dihydroxy- 
30 benzoic acid and para-aminobenzoic acid each at 5 ug/ml to 
minimize the effects of inhibition of the E. coli EPSP synthase). 
The cell pellet (approx. 0.5 g wet weight) was resuspended in 1 
ml of lysis buffer (40 mM MOPS, pH7.4; 4 mM Tricine, pH 7.4; 
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10% glycerol; 1 mM DTT) and passed twice through a French 
Press. The cell debris was removed by centrifugation at 15000 
rpm for 10 minutes. The supernatant was assayed, following 
addition of MgCb to 10 mM, for degradation of radiolabeled 

5 glyphosate. The glyphosate substrate was supplied as 
[3-i4C]glyphosate (final concentration = 17 uM). The products 
observed were predominantly AMPA and some 
N-acetylAMPA; the production of AMPA is indicative of the 
cloned enzymatic activity from strain LBAA but the 

10 N-acetylAMPA could be due to endogenous E. coli activities 
(Avila et al., 1987). The specific activity for AMPA formation 
under these conditions was 13.3 pmoles AMPA/minute.mg 
protein. 

15 nVtnT»rtPriynHon of th e Glyphosate- to- AMPA Gene 

The cloned region responsible for this glyphosate 
oxidoreductase enzymatic activity was then localized in the 
cosmid. Deletions of pMON7468 were isolated, primarily within 
the cloned region, by using restriction enzymes that cut 
20 infrequently within the insert, as follows: plasmid DNA 
samples of 0.5 - 2 ug were digested to completion with 
restriction endonucleases Notl, Sad, Bglll or BcmHI, 
extracted with phenohchloroform, ethanol precipitated, 
resuspended in TE buffer and ligated for 2-4 hours at room 
25 temperature (or for 18 hours at 16°C) in a final volume of 50 ul 
with ligation buffer and T4 DNA ligase. Transformants were 
selected in E. coli SR200 Mpu+ and these deletions were 
examined for loss or retention of the glyphosate utilization 
phenotype. These data, in conjunction with restriction 
30 mapping of the clones, were used to localize the active region to 
near the central portion of the insert in pMON7468 that 
included the two common Hindlll fragments (6.4 and 4.2 kb). 
The HindUl restriction fragments from this region were then 
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subcloned into pBlueScript (Stratagene) and their glyphosate 
phenotype determined in E. coli JM101 Mpu* (the Mpu+ 
derivative of JM101 was isolated as described for SR200 Mpu«0. 
Clones containing the 6.4 kb Hindlll fragment, in either 
orientation, resulted in glyphosate utili2ation. Following 
restriction mapping of this Hindlll fragment, a series of 
deletion clones were isolated from the two 6.4 kb Hindlll clones 
using enzymes that cut infrequently in the insert and also in 
the polylinker region. A number of restriction fragments 
internal to the Hindlll fragment were also subcloned. The 3.5 
kb Pstl and 2.5 kb Bglll fragments, in either orientation, were 
positive for glyphosate utilization. These data, combined with 
those from the deletions, were used to localize the active region 
to an approximately 1.8 kb Bglll-Xhol fragment. In addition, 
deletions isolated from the 6.4 kb JfmdIII fragment indicated a 
TninimMTw coding region size of around 0.7 kb, with the EcoBl 
and Sad sites probably located within the coding sequences. 

The direction of transcription/expression of the locus 
responsible for the glyphosate- to- AMPA enzymatic activity was 
determined as follows: E. coli JM101 Mpu> transformants of 
pMON7469 #1 and #4 (Clones of the 2.5 kb BgtU fragment in 
the BamHI site of pUCHS; opposite orientations) were grown in 
M9-glucose- thiamine- ampicillin broth, with and without the 
Plac inducer IPTG, harvested in late log phase (Klett 190-220), 
cell-free lysates of the four cultures were prepared as described 
above and were assayed' for glyphosate- to- AMP A activity with 
glyphosate at 17 *iM. The highest enzymatic activity was 
obtained for pMON7469 #1 plus IPTG, where the Xhol site is 
distal to the Plac, suggesting that the gene(s) were expressed in 
the BglU-to-Xhol direction. 
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TABLE II - Olvphnflflte to AMPA Activity in 
Cell-Free Lvsatea of E. coli Transformants 



Clone 



IPTG 

added 



Specific Activity 
pinoles AMPA /roin.mg 



10 



15 



20 



25 



pMON7469#l 
pMON7469#l 
pMON7469#4 
pMON7469#4 



no 
yes 
no 
yea 



<3.0 
32.0 
<3.0 
<3.0 



The only product observed was AMPA, suggesting that the 
AMPA acetylating activity that was described earlier had been 
induced in E. coli transformants growing on glyphosate as the 
P source. 

In a later experiment, cell lysates of pMON7469#l 
and pMON7470 (Bglll-Xhol 1.8 kb in pUC118; formed from 
pMON7469 #1 by deletion of the - 700 bpXhoI-Sall fragment) 
were assayed for glyphosate- to- AMP A activity with glyphosate 
at 2 mM (Sp. Act. [3-nC]glyphosate = 3.7 mCi/mmol; 0.2 
liCi/reaction; cultures grown with IPTG in medium) and much 
higher enzymatic activities were recorded, reflecting the 
improved assay conditions. 

TAHLF m - Glvnhoaate to AMPA Activity in 
Cell-Free Tyrants of K mil Trflnwfnrmnnta 



30 



Clone 

pMON7469#l 
pMON7470 



Specific Activity 
nmolea AMP A/mi n.mg 

15.04 

7.15 
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The proteins encoded by the Bglll fragment were 
determined in vivo using a T7 expression system (Tabor and 
Richardson, 1985) following cloning of this fragment into the 
BamRl site in the vector pBlueScript (+) (pMON7471 #1, #2; 
5 opposite orientations). Test and control plasmids were 
transformed into E. coli K38 containing pGPl-2 (Tabor and 
Richardson, 1985) and grown at 30°C in L-broth (2 ml) with 
ampicillin and kanamycin (100 and 50 Lig/ml, respectively) to a 
Klett reading of - 50. An aliquot was removed and the cells 
10 collected by centrifugation, washed with M9 salts (Miller, 1972) 
and resuspended in 1 ml M9 medium containing glucose at 
0.2%, thiamine at 20 ug/ml and containing the 18 amino acids 
at 0.01% (minus cysteine and methionine). Following 
incubation at 30°C for 90 minutes, the cultures were 
15 transferred to a 42°C water bath and held there for 15 minutes. 
Rifampicin (Sigma) was added to 200 ug/ml and the cultures 
held at 42°C for 10 additional minutes and then transferred to 
30°C for 20 minutes. Samples were pulsed with 10 jiCi of 
35S-methionine for 5 minutes at 30°C» the cells collected by 
20 centrifugation and suspended in 60-120 ul cracking buffer (60 
mM Tris-HCl 6.8/1% SDS/1% 2-mercaptoethanoV10% 
glycerol/0.01% bromophenol blue). Aliquot samples were 
electrophoresed on 12.5% SDS-PAGE and following soaking for 
60 minutes in 10 volumes of Acetic Acid-Methanol-water 
25 (10:30:60), the gel was soaked in ENLIGHTNING ™ (DUPONT) 
following manufacturer's directions, dried, and exposed at 
-70 =o C to X-Ray Film. Proteins labeled using 35S-methionine 
were detected only for the Bglll-to-Xhol direction, the largest 
about 45 kd in size. When the Bg lll-Xhol fragment was 
30 examined following cloning into the BamHI-X/ioI sites of 
pBlueScript (to form pMON7472), this -45 kd protein was still 
expressed. 
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The effect of expression of the glyp ho sate- to- AMP A 
activity on glyphosate tolerance of E. coli was determined 
initially by examining the growth of recombinants in media 
containing inhibitory concentrations of glyphosate. The test 

5 compared the growth of E. coli JM101 containing a control 
vector (pUC118; Viera and Messing, 1987) or the pUC118 clones 
of the 2.5 kb Bglll fragment (pMON7469 #1, #4). There was a 
very clear correlation between the glyphosate-utilization ability 
and glyphosate tolerance. This tolerance phenotype (resistance 

10 to 15 mM glyphosate) was then employed as a screen to quickly 
monitor for the phenotype of deletion clones such as pMON7470 
{Bglll-Xhol 1.8 kb in pUC118; formed from pMON7469 #1 by 
deletion of the - 700 hj> Xhol-Sall fragment) and later clones. 

!5 Mncleotide Sequence of the Structural Glvnhosate 
O-ridoreduH^fle Gene 

The nucleotide sequence of the Bglll-Xhol fragment 
(SEQ ID NO:3) was determined using single-stranded DNA 
templates (generated using the phage mid clones and the 
20 "helper" M13 phage R408) and the commercially available 
SEQUENASE ™ (International Biotechnologies, Inc.) kit. 
Computer analysis of the sequence (SEQ ID NO:3) revealed a 
single large open reading frame (ORF) in the BglU to Xhol 
direction and is presented in Figure 2 which includes the 
2> location of some of the relevant restriction Bites. The putative 
stop codon (UAA) was located 2 bp 5* of the Seal restriction cut 
site. Data to confirm that this UAA codon was the termination 
codon of the -45 kd ORF were derived as follows: previously the 
3* limits had been determined, based on the glyphosate 
30 utilization phenotype, to be between the Sad site (95 bp 
upstream of the Seal site) and the Xhol site. When the 
Bglll-Scal fragment was cloned into the BamHI-Smal sites of 
pBlueScript and the proteins expressed in vivo, the -45 kd 
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protein was still produced. The Bglll-Scal fragment was then 
recloned from this pBlueScript clone as Xbal-Hindlll into 
pUC118 Xbal-Hindlll and was found to confer resistance to 15 
mM glyphosate to E. coli JM101 transform ants. These data 
5 located the C-terminus of the -45 kd protein between the Sad 
and Seal sites. The only stop codon, in any reading frame, 
between these sites is that immediately upstream of the Seal 
site. 

There were two methionine codons (AUG; located at 
10 positions 120 and 186) that if used as the fMet would give rise to 
proteins of 46.140 and 44.002 kd, respectively, but neither was 
preceded by a clearly recognizable Shine-Dalgarno sequence. 

The start of the protein was delineated more precisely 
as follows: Bglll restriction site recognition sequences were 
15 introduced at positions upstream of the two potential start 
codons by site-directed mutagenesis of pMON7470, substituting 
AGATCT for the sequences AGACTG ("Bgl20 H ) and GTATGC 
CBgl86*'), 21 and 9 bp upstream of the AUG120 and AUGi86 t 
respectively. Except where noted, oligonucleotide primers for 
20 mutagenesis comprised the sequences to be altered flanked by 8- 
10 homologous bases on each aide. The glyphosate tolerance 
was determined for the mutated clones. Introduction of the 
Bglll site upstream of AUG120 had no effect on glyphosate 
tolerance while it was abolished by the mutagenesis that 
25 introduced the Bglll site upstream of AUGm. The effects of 
these mutageneses on the - 45 kd protein were examined by 
subcloning the mutated sequences into T7 expression vectors 
using a site in the polylinker of pMON7470 (Kpnl\ just 
upstream of the original Bglll site, and the downstream 
30 ifindlll site. This complete fragment was recloned into 
pl8UT3T7 (PHAKMACIA) Kpnl-HindlU and tested in Y1YQ as 
described above. The - 45 kd protein was still expressed and a* 
comparable levels from both of the "Bglll" mutagenized 
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10 



15 



20 



25 



30 



sequences. When the new Bglll sites were used as 5* ends (and 
the downstream tfmdIII site) for cloning into the pBlueScript 
BamHI-Hindlll sites, the -45 kd protein was still expressed 
when the new Bglll site upstream of AUG120 served as 5' end, 
but not when that located upstream of AUG186 was the 5* end. 
These data suggest strongly that the AUG 120 (or some codon 
located very close to it) is the N-terxninus of the glyphosate oxido- 
reductase protein. The Bglll site introduced upstream of the 
AUG 186 did not result in a prematurely terminated or highly 
unstable protein and suggests that the predicted coding 
sequence changes resulting from this mutagenesis 
(Valis-Cysig --> Argia-Alaxg) had severe effects on the activity of 

the enzyme. 

Further data to confirm the location of the 
N-terminus were obtained by introducing separately (by 
mutageneses of pMON7470), an Ncol restriction site 
recognition sequence (CCATGG for CTATGT; changes the 
second codon from Serine to Alanine) or an Ndel sequence 
(CATATG for CCTATG) at AUG120 and expressing this ORF 
using efficient E* coli expression vectors. The expression of the 
Ndel version is outlined here: the Ndel-Hindlll fragment, 
beginning at the putative AUG, was cloned into pMON2123 
{Ndel-Hin&IlD replacing the ompF-IGF-1 fusion fragment 
(Wong et aL, 1988). The resultant clone was introduced into 
E. coli JM101 and the cells induced with nalidixic acid as 
described (Wong et al„ 19.88) for 2 hours. The resultant protein 
was indistinguishable in size from the -45 kd protein on SDS 
PAGE and a cell lysate from an induced culture had a 
glyphosate oxidoreductase specific activity of 12.8 nmoles 
AMPA/min.mg. When compared in a separate experiment, no 
differences were observed for the glyphosate oxidoreductase 
activity when the second codon was Alanine instead of Serine. 
The structural DNA sequence for the glyphosate oxidoreductase 



Q?nd 1559 



^1992 DERWENT PUBLICATIONS LTD 



WO 92/00377 



PCT/US91/04514 



-29- 



enzyme (SEQ ID NO:4) begins at nucleotide 120 and ends at 
nucleotide 1415 of the S^/II-XhoI fragment of Figure 2 and the 
glyphosate oxidoreductese enzyme consists of 431 amino acids 
(SEQ ID NO:5). 



10 



15 



20 



25 



30 



Construction of Glvphnflflte OT idorediictage Plant Gene 

Tranafonnatinn Vectors 

To facilitate the manipulation of the structural 
glyphosate ozidoreductase gene, the internal EcoRl and Ncol 
restriction site recognition sequences were removed by site- 
directed mutagenesis to substitute the sequence GAATTT for 
GAATTC and CCACGG for CCATGG, respectively. A 
glyphosate ozidoreductase coding sequence suitable for 
introduction into and expression in plant transformation 
vectors was assembled in the following way: the Ncol ("Met-Ala- 
") N-terminua was combined with the Ncol- and 2?coRI-deleted 
coding sequences, and the C- terminus deleted to the Seal site, 
in a number of cloning steps using the internal SphI and 
EcoRV restriction sites. In these steps a BglH site was located 
immediately upstream of the Ncol site and EcoBl and ifindlll 
sites were located immediately downstream from the stop 
codon. The sequence of this manipulated glyphosate 
ozidoreductase gene (SEQ ID NO:6) is shown in Figure 3. The 
manipulated glyphosate ozidoreductase gene still codes for the 
wild-type glyphosate ozidoreductase protein. The 
manipulations do not alter the amino acid sequence of the 
glyphosate ozidoreductase. This glyphosate ozidoreductase 
structural sequence (SEQ ID NO:6), as a BglTUNcol- 
EcoRVHin&Hl fragment of 1321 bp, is readily cloned into an 
appropriate plant expression cassette. This glyphosate 
ozidoreductase gene (SEQ ID NO:6) was cloned as a Bglll- 
EcoRl fragment into the plant transformation and expression 
vector pMON979 to form pMON17073. 
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Modifieation and Resvnthesis of th e Olvphosate Oxidoreductase 
Pene Sentience 

The glyphosate oxidoreductase gene from LBAA 
contains sequences that could be inimical to high expression of 

5 the gene in plants. These sequences include potential 
polyadenylation sites that are often A+T-rich, a higher G+C% 
than that frequently found in plant genes (56% versus -50%), 
concentrated stretches of & and C residues, and codons that are 
not used frequently in plant genes. The high G+C% in the 

10 glyphosate oxidoreductase gene has a number of potential 
consequences including the following: a higher usage of G or C 
than that found in plant genes in the third position in codons, 
and the potential to form strong hair-pin structures that may 
affect expression or stability of the RNA. The reduction in the 

15 G+C content of the glyphosate oxidoreductase gene, the 
disruption of stretches of G's and C's, the elimination of 
potential polyadenylation sequences, and improvements in the 
codon usage to that used more frequently in plant genes, could 
result in higher expression of glyphosate oxidoreductase in 

20 plants. 

In the first phase of this experiment, selected regions 
of the gene were modified by Bite-directed mutagenesis. These 
modifications were directed primarily (but not exclusively) at 
reducing the G+C% and at breaking up some of the G+C 

25 clusters. The manipulated glyphosate oxidoreductase gene was 
firBt recloned into the phagemid vector pMON7258 as a Ncol- 
HindKL fragment to form pMON17014. Single stranded DNA 
was prepared from a dut ung E. coli strain. Seven regions of 
the gene were modified by site-directed mutagenesis using the 

30 primers listed in Table IV and the Bio Rad mutagenesis kit 
(Catalog #170-3576) and following the protocols provided with 
this kit. 
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For the sake of clarity, the reverse complement of the 
actual primers is presented. The base positions, in the 
sequences presented in Figure 2 and in Figure 3, corresponding 
to the primers are indicated by the first and second set of 
5 numbers, respectively. 

TATVTF, TV - Primps tn Modify thq Glvnhoaate Oxidoreductaae 
ffrnA finding Sequence 

10 ESIMES-1 (149-210; 38-99) 

CGCTGGAGCT GGAATCGTTG GTGTATGCAC TGCTTTGATG CTTCAACGTC 
GTGGATTCAA AG (SEQ ID NO:27) 



15 



"PRIMER 2 (623-687: 512-576) 

GCAGATCCTC TCTGCTGATG CTTTGCGTGA TTTCGATCCT AACTTGTCGC 
ATGCTTTTAC CAAGG (SEQ ID NOl28) 



20 



EBIMER3 (792-832; 681-721) 

GTCATCGGTT TTGAGACTGA AGGTCGTGCT CTCAAAGGCA T (SEQ ID NO:29) 

PRTMRR 4 (833-901: 722-790) 

TACAACCACT AACGGTGTTC TGGCTGTTGA TGCAGCTGTT GTTGCAGCTG 
GTGCACACTC TAAATCACI (SEQ ID NO:30) 



25 



R 5 (1031-1091: 920-580) 

GGAAATGGGT CTTCGTGTTG CTGGTACTGT TGAGTTTGCT GGTCTCACAG 

ctgctcctaa c (SEQ ID NO:31) 



30 



i£ (1179-1246; 1068-1135) 

TGGATGGGTT TTCGTCCTAG CATTCCTGAT TCTCTTCCAG TGATTGGTCG 
TGCAACTCGT ACACCCGA (SEQ ID NO:32) 
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PRIMER 7 (1247-1315: 1136-1204) 

CGTAATCTAT GCTTTTGGTC ACGGTCATCT CGGTATGACA GGTGCTCCAA 
TGACTGCAAC TCTCGTCTC (SEQ ID NO:33) 

The resultant gene (SEQ ID NO:7) was confirmed by 
sequencing and by the ability to provide comparable glyphosate 
tolerance levels as the manipulated glyphosate oxidoreductase 
gene control. This modified gene (SEQ ID NO:7) is referred to 
as "modified glyphosate oxidoreductase." The G+C% of the 
glyphosate oxidoreductase gene (SEQ ID NO:6) was reduced 
from -56% in the manipulated version to -52% in the modified 
version (SEQ ID NO:7). A comparison of the manipulated and 
modified glyphosate oxidoreductase gene is shown in Figure 3, 
with the manipulated version on top and the changes 
introduced to make the modified version on the bottom. This 
modified glyphosate oxidoreductase gene was cloned as a Bglll- 
EcoKI fragment into a plant expression cassette comprising the 
En-CaMV35S promoter and the NOS 3* sequences. This 
cassette was then cloned as a NotI fragment into the pMON886 
2Q vector to form pMON17032 (Figure 5). 

A synthetic glyphosate oxidoreductase gene (SEQ ID 
NO:8) was designed to change as completely as possible those 
jf^Tni^ftl sequences discussed above. In summary, the gene 
sequence was redesigned to eliminate as much as possible the 
following sequences or sequence features (while avoiding the 
introduction of unnecessary restriction sites): stretches of G's 
and C's of 5 or greater; A+T rich regions (predominantly) that 
could function as polyadenylation sites or potential RNA 
destabilization region, and codons not frequently found in plant 
genes. A comparison of the manipulated (SEQ ED NO:6) and 
synthetic (SEQ ED NO:8) glyphosate oxidoreductase genes is 
presented in Figure 4, with the manipulated gene (SEQ ID 
NO:6) on top and the differences introduced into the synthetic 
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gene (SEQ ID NO:8) on the bottom. The G+C% for the synthetic 
glyphosate oxidoreductase gene is -51% and the potential to 
form short, high energy, hair-pin structures is reduced. This 
synthetic gene was cloned as a Bglll-EcoRI fragment into 
pMON979 to form pMON17065 for introduction into plants. 

KTprAgflinn of nhlnrcmlag t Directed Olvphosate Oxidoreductase 

The glyphosate target in plants, the 5-eno/pyruvyl- 
shikimate-3-phosphate synthase (EPSPS) enzyme, is located in 
the chloroplast. Although glyphosate oxidoreductase activity 
located in the cytoplasm reduces/prevents glyphosate from 
reaching the chloroplast in the transgenic plant, directing the 
glyphosate oxidoreductase enzyme to the chloroplast has been 
found to further minimize the effects of glyphosate on EPSP 
15 synthase. Many chloroplast-localized proteins are expressed 
from nuclear genes as precursors and are targeted to the 
chloroplast by a chloropiast transit peptide (CTP) that is 
removed during the import steps. Examples of such 
chloroplast proteins include the small subunit (SSU) of Ribulose- 
20 1,5-bisphosphate carboxylase (RUBISCO), 5-€no/-pyruvyl- 
shikimate-3-phosphate synthase (EPSPS), Ferredoxin, 
Ferredoxin oxidoreductase, the Light-harvesting-complex 
protein I and protein II, and Thioredoxin F. It has been 
demonstrated in vivo and in vitro that non-chloroplast proteins 
25 may be targeted to the chloroplast by use of protein fusions with 
a CTP and that a CTP sequence is sufficient to target a protein 
to the chloroplast (della-Cioppa et al., 1987). 

The glyphosate oxidoreductase protein was targeted 
to the chloroplast by construction of a fusion between the 
C-terminus of a CTP and the N-terminus of glyphosate 
oxidoreductase. In the first example, a specialized CTP, 
derived from the SSU 1A gene from Arabidopsis thaliana 
(Timko et al., 1988) was? used. This CTP (designated CTP1) was 
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constructed by a combination of site-directed mutageneses. The 
CYP1 structure (SEQ ID NO:9) (Figure 6) is made up of the SSU 
1A CTP (amino acids 1-55), the first 23 amino acids of the 
mature SSU 1A protein (amino acids 56-78), a serine residue 
(amino acid 79), a new segment that repeats amino acids 50 to 
56 from the SSU 1A CTP and the first two amino acids from the 
mature protein (amino acids 80-87), and an alanine and 
methionine residue (amino acids 88 and 89). An iVcol 
restriction site is located at the 3' end (spans the Met codon) to 
facilitate the construction of precise fusions to the 5* of 
glyphosate oxidoreductase or other genes. At a later stage, a 
Bglll site was introduced upstream of the N terminus of the 
SSU 1A sequences to facilitate the introduction of the fusions 
into plant transformation vectors. A fusion was assembled 
between the CTP1 (SEQ ID NO:9) and the manipulated 
glyphosate oxidoreductase (SEQ ID NO:6) (through the Ncol 
site) in the pGEM3zft+) vector to form pMON17034. This vector 
may be transcribed in vitro using the SP6 polymerase and the 
RNA translated with 3&S-Methionine to provide material that 
may be evaluated for import into chloroplasts isolated from 
Lactitca sativa using the methods described hereinafter (della- 
Cioppa et aL t 1986, 1987).. This CTPl-glyphosate oxidoreductase 
fusion was indeed found to be imported into chloroplasts at 
about 9% efficiency of that of the control, 35S labeled PreEPSPS 
(pMON6140; della-Cioppa et al., 1986). A CTPl-glyphosate 
oxidoreductase fusion was then assembled with the synthetic 
glyphosate oxidoreductase gene (SEQ ID NO:8) and this was 
introduced as a Bglll-EcoRI fragment into plant vector 
pMON979 to form pMON17066 (Figure 7). Following an 
intermediate cloning step to acquire more cloning sites, this 
CTPl-glyphosate oxidoreductase fusion was also cloned as a 



Xbal-BamHI site into pMON981 to form pMON17138 (Figure 8). 
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In the second example, a CTP-glyphosate 
oxidoreductase fusion was constructed between the Arabidopsis 
thaliana EPSPS (Klee et al., 1987) CTP and the synthetic 
glyphosate oxidoreductase coding sequences. The Arabidopsis 
CTP was first engineered by site-directed mutagenesis to place 
a Sphl restriction site at the CTP processing site. This 
mutagenesis replaced the Glu-Lys at this location with Cys- 
Met. The sequence of this CTP, designated CTP2, (SEQ ID 
NO:10) is shown in Figure 9. The Ncol site of the synthetic 
glyphosate oxidoreductase gene (SEQ ID NO:8) was replaced 
with a Sphl site that spans the Met codon. The second codon 
was converted to one for leucine in this step also. This change 
had no apparent effect on the in vivo activity of glyphosate 
oxidoreductase in E. coli. The CTP2-synthetic glyphosate 
oxidoreductase fusion was cloned into pBlueScript KS(+) and 
this template was transcribed in vitro using T7 polymerase and 
the 35S-methionine-labeled material was shown to import into 
chloroplasts with an efficiency comparable to that for the CTP1- 
glyphosate oxidoreductase fusion. This CTP2- synthetic 
glyphosate oxidoreductase fusion was then cloned as a Xbal- 
BamUl fragment into a plant expression vector to form 
pMON17164. A structural map of this plasmid is presented in 
Figure 12. 

The plant vector portion of pMON17164 (Figure 12) is 
composed of the following segments. A chimeric kanamycin 
resistance gene engineered for plant expression to allow 
selection of the transformed tissue. The chimeric gene consists 
of the 0.35 Kb cauliflower mosaic virus 35S promoter (P-35S) 
(Odell et al., 1985), the 0.83 Kb neomycin phosphotransferase 
typell gene (KAN), and the 0.26Kb 3'-non-translated region of 
the nopalinee synthas gene (NOS 3') (Fraley et al., 1983). A 
0.45 Kb Clal to Dral fragment from the pTil5955 octopine Ti 
plasmid, which contains the T-DNA left border region (Barker 
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et ah, 1983) A 0.75 Kb segment containing the origin of 
replication from the RK2 plasmid (ori-V) (Stalker et al. t 1981) A 
3.0 Kb Sail to PatI segment of pBR322 which provides the origin 
of replication for maintenance in E. coli (ori-322) and the bom 

5 site for the conjugational transfer into Agrobacterium 
tumefaciens cells. A. 0.93 Kb fragment isolated from 
transposon Tn7 which encodes bacterial 
spectinomycin/streptomycin resistance (Spc/Str) (Fling et al. t 
1985), and is a determinant for selection in E. coli and 

10 Agrobacterium tumefaciens. A 0.36 Kb Pvul to Bell fragment 
from the pTiT37 plasmid, which contains the nop aline* type T- 
DNA right border region (Fraley et al. t 1985). An expression 
cassette consisting of the 0.6 Kb 35S promoter from the figwort 
mosaic virus (P-FMV) (Gowda et al., 1989), several unique 

X5 cloning sites, and the 0.7 Kb 3* nontranslated region of the pea 
rbcS-E9 gene (£9 3') (Coruzzi et al., 1984, and Morelli et al., 
1985). The CTP2- synthetic glyphosate oxidoreductase fusion 
fragment was cloned into this expression cassette. The 
introduction of this plasmid into Agrobacterium and 

20 subsequent plant transformation is described in the Examples 
to follow. 

Those skilled in the art will recognize that various 
chimeric constructs can be made which utilize the functionality 
of a particular CTP to import the contiguous glyphosate 
25 oxidoreductase enzyme into the plant cell chloroplast. The 
chloroplast import of the glyphosate oxidoreductase can be 
determined using the following assay. 

Chloroplast Uptake Assay 
30 Intact chloroplasts are isolated from lettuce (Latuca 

sativa, var. longifolia) by centrifugation in Percoll/ficoll 
gradients as modified from Bartlett et al. (1982). The final pellet 
of intact chloroplasts is suspended in 0.5 ml of sterile 330 mM 
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sorbitol in 50 mM Hepes-KQH, pH 7.7, assayed for chlorophyll 
(Arnon, 1949), and adjusted to the final chlorophyll 
concentration of 4 mg/ml (using Borbitol/Hepes). The yield of 
intact chloroplasts from a single head of lettuce is 3-6mg 
5 chlorophyll. 

A typical 300 ul uptake experiment contained 5 mM 
ATP, 8.3 mM unlabeled methionine, 322 mM sorbitol, 58.3 mM 
HepeB-KOH (pH 8.0), 50 \A reticulocyte lyBate translation 
products, and intact chloroplasts from L. sativa (200 ug 
^ chlorophyll). The uptake mixture is gently rocked at room 
■ temperature (in 10 x 75 mm glass tubes) directly in front of a 
fiber optic illuminator set at maximum light intensity (150 Watt 
bulb). Aliquot samples of the uptake mix (about 50 ul) are 
removed at various times and fractionated over 100 ul silicone- 
15 oil gradients (in 150 ul polyethylene tubes) by centrifugation at 
11,000 X g for 30 seconds. Under these conditions, the intact 
chloroplasts form a pellet under the silicone-oil layer and the 
incubation medium (containing the reticulocyte lysate) floats on 
the Burface. After centrifugation* the silicone-oil gradients are 
20 immediately frozen in dry ice. The chloroplast pellet is then 
resuspended in 50*100 ul of lysis buffer (10 mM Hepes-KOH pH 
7.5, 1 mM PMSF, 1 mM benzamidine, 5 mM £- amino -n-caproic 
acid, and 30 ugftnl aprotinin) and centrifuged at 15,000 X g for 
20 minutes to pellet the thylakoid membranes. The clear 
supernatant (stromal proteins) from this spin, and an aliquot of 
the reticulocyte lysate incubation medium from each uptake 
experiment, are mixed with an equal volume of 2X SDS-PAGE 
sample buffer for electrophoresis (see below). 

SDS-PAGE is carried out according to Laemmli (1970) 
in 3-17% (w/v) acrylamide slab gels (60 mm X 1.5 mm) with 3% 
(w/v) acrylamide stacking gels (5 mm X 1.5 mm). The gel is 
fixed for 20-30 minutes in a solution with 40% methanol and 
10% acetic acid. Then, the gel is soaked in EN3HANCE™ 
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(DuPont) for 20-30 minutes, followed by drying the gel on a gel 
dryer. The gel is imaged by autoradiography, using an 
intensifying screen and an overnight exposure to determine 
whether the glyphosate oxidoreductase is imported into the 
5 isolated chloroplasts. 

Alternative Isolation Protocol for Other Glyphosate 
Qridoreductase Structural, flenea 

A number of other glyphosate oxidoreductase genes 

10 have been identified and cloned, including the second LBAA 
glyphosate oxidoreductase gene from the Class II cosmid 
pMON7477. The gene was located, by Southern hybridization, 
on the -23 kb H in dill fragment, discussed in the cloning 
section above, using the first glyphosate oxidoreductase gene as 

15 a probe. Southern analysis also showed Pstl and Bglll 
hybridizing bands of -3.5 and -2.5 kb, respectively. The Bglll 
fragment from pMON7477 was subclone d into the SamHI site of 
pBlueScript vector. A clone in E. coli JM101 (pMON7482), in 
which the cloned fragment was oriented relative to the lac 

20 promoter as in pMON7469#l, was induced with IPTG and 
assayed for glyphosate oxidoreductase activity. In this 
experiment a Sp. Act. of -93 mnol/min.mg was obtained. In a 
later experiment, Class I and Class II cosmids were also 
isolated following infection of E. coli JM101 with the same 

25 packaged cosmid preparation and selection, directly for 
glyphosate tolerance at 3-5 mM glyphosate on M9 media. 

A glyphosate oxidoreductase gene has also been 
subcloned from another microbial isolate, identified originally 
by its ability to utilize glyphosate as a phosphorous source and 

30 later shown to contain a putative glyphosate oxidoreductase 
gene by hybridization with the LBAA glyphosate oxidoreductase 
gene probe. This gene was cloned initially in a T7 promoter 
cosmid by screening for glyphosate tolerance in E. coli 
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HB101/pGPl-2 (Boyer and^Rolland-Dussoix, 1969; Tabor and 
Richardson, 1985) on M9 medium containing glyphosate at 3 
mM. The presence of the glyphosate oxidoreductase gene was 
first indicated by a positive hybridization signal with the LBAA 
gene and by its location on a 2.5 kb Bglll fragment. This Bglll 
fragment was cloned into the 2? am HI site in pBlueScript 
(pMON17183) and expressed from the lac promoter by addition 
of IPTG. In this experiment a glyphosate oxidoreductase with 
a specific activity of 53 nmoles/min.mg was obtained, 
confirming the isolation of the gene by this Btrategy. The 
following features have usually been found for these glyphosate 
oxidoreductase genes: the genes are found (by Southern 
hybridization using full-length glyphosate oxidoreductase gene 
probes) on -2.5 kb Bglll fragments, on -3.5 Pstl fragments, 
contain one EcoRI site within the gene and the genes do not 
contain a Hmdlll site. The following schematic diagram 
illustrates some common features of these genes. 



GOX coding region 

The high degree of similarity of glyphosate 
oxidoreductase genes also suggests another way by which new 
glyphosate oxidoreductase genes may be cloned. The apparent 
conservation of regions flanking the genes and the absence of 
certain restriction sites suggests the use of single- stranded 
oligonucleotide probes to the flanking regions, containing 
restriction sites for Bglll, Hindlll, Pstl, BamHI, Ndel, or 
other suitable cloning sites, and PCR (Polymerase Chain 
Reaction; see Erlich, 1989, for complete details on PCR and its 
applications) to amplify a glyphosate oxidoreductase gene 
fragment suitable for cloning. The flanking sequences for 119 
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bp upstream (SEQ ID N0;11) of the wild-type (LBAA isolate) 
glyphosate oxidoreductase gene and for -290 bp (SEQ ID N0:12) 
downstream of the gene are provided in Figure 2. 

Using this PGR approach, glyphosate oxidoreductase 
genes from a number of sources have been isolated. The 
presence of the glyphosate oxidoreductase activity was 
confirmed by cloning the glyphosate oxidoreductase gene from 
chromosomal DNA prepared from Pseudomonaa sp. strain LBr 
(Jacob et al. t 1988) and using primers homologous to the N- and 
C-termini of the LBAA glyphosate oxidoreductase gene and 
containing the following suitable restriction cloning sites: 

5 • -GAGAGACTGT CGACTCCGCG GGAGCATCAT ATG-3 • (SEQ ID NO: 13) 
and 5 * -GAACGAATCC AAGCTTCTCA CGACCGCGTA AGTAC-3 1 (SEQ ID 

no : 14) . Cyclotherm parameters used for these PCR reactions is 
as follows: 

Denature at 94° C for 1 minute; 

Anneal at 60° C for 2 minutes; 

Polymerize at 72° C for 3 minutes, 

30 cycles, no autoextension, linked to 4° C incubation. 
The expected -1.3 kb PCR produced was generated and 
following digestion with Ndel and HindUl, this fragment was 
cloned into pMON2123 for expression of the encoded enzyme. 
The glyphosate oxidoreductase activity was measured as 
described above and the Kn, for glyphosate was similar to that 
for enzymes from LBAA which is presented aupra . 

source of glyphosate KpfglYPhOfiatfi; °lM^ 

oTidoreductase gene 

Pseudomonos sp. strain LBr 25 

Bacteria isolated from glyphosate process waste 
stream treatment facilities may also be capable of converting 
glyphosate to AMP A. Pseudomonas strains LBAA and LBr are 
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two such examples. Such bacteria may also be isolated de novo 
from these waste treatment facilities. 

A population of bacteria was isolated from a fixed-bed 
immobilized cell column, which employed Mannville R-635 
diatomaceous earth beads, by plating on Tryptone Soy Agar 
(Difco), containing cycloheximide at 100 ug/ml, and incubating 
at 28°C. The column had been run for three months on a waste- 
water feed from the Monsanto Company's Luling, MS, 
glyphosate production plant. The column contained 50 mg/ml 
glyphosate and NH 3 as NH 4 C1, Total organic carbon was 300 
mg/ml and BOD's (Biological Oxygen Demand - a measure of 
"soft" carbon availability) was less than 30 mg/ml. This 
treatment column has been described ( Heitkamp et al. v 1990). 
One of the predominant members of this population, identified 
as Agrobacterium sp. strain T10, was found to also grow in 
minimal broth in which the sole carbon source provided was 
glyphosate at 10 mM (this broth was made up as for DF 
medium but with glyphosate substituting for the glucose, 
gluconate and citrate). Chromosomal DNA was prepared from 
this isolate and subjected to the same PCR procedure and with 
the same primers as described above for the strain LBr. A 
fragment of the correct size was generated and cloned into the 
E. coli expression vector. The glyphosate oxidoreductase 
activity was assayed and the K m for glyphosate also determined: 

p onrcft of gene KuifplYPhftflf***: ™ M> > 

Agrobacterium sp. strain T10 28 

Glyphosate- to-AMP A conversion has been reported 
for many different soils (see Malik et al„ 1989 for a review) and 
a number of procedures are available for the xtraction of total 
DNA from mixed environment samples such as soil (Holben et 
al., 1988; Steffan and Atlas, 1988; Tsai and Olson, 1991), 
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indicating the possibility of cloning glyphosate oxidoreductase 
genes without having to first isolate such a degrading 
microorganism. Of course, the procedure described for the 
cloning of the glyphosate oxidoreductase genes, based on the 
5 conferring of a glyphosate utilization ability or glyphosate 
tolerance on E. coli, provides a scheme by which other 
glyphosate oxidoreductase genes and other glyphosate 
metabolizing genes may be cloned, without relying on the 
homology determined for the glyphosate oxidoreductase gene 
10 described here. It is possible also to enrich for glyphosate 
degrading bacteria, for example, by the repeated application of 
glyphosate to a patch of soil (Quinn et al., 1988, Talbot et al., 
1984). This enrichment step might be used to increase the ease 
with which glyphosate oxidoreductase genes are recovered 
15 from soil or other environments. 

Evidence for the presence of the glyphosate 
oxidoreductase gene in soil bacteria and a procedure for the 
isolation of such genes is outlined in the following: A 
population of suitable bacteria was enriched for selection of 
20 bacteria capable of growing in liquid media with glyphosate (at 
10 mM) as a source of carbon (This medium is made up as 
described for the Dworkin-Foster medium but with the 
omission of the carbon sources and with Pi as a source of P). 
The inoculum was provided by extracting soil (from a recently 
25 harvested soybean field in Jerseyville, Illinois) and the 
population selected by successive culturing in the medium 
described above at 28°C (cycloheximide was included at 100 
ugfml to prevent growth of fungi). Upon plating on L-agar 
medium, 5 colony types were identified. Chromosomal DNA 
30 was prepared from 2 ml L-broth cultures of these isolates and 
the presence of the glyphosate oxidoreductase gene was probed 
using PGR screening. Using the primers 
GCCGAGATGACCGTGGCCGAAAGC <SEQ ID NO:15) and 
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GGGAATGCCGGATGCTTCAACGGC (SEQ ID NO: 16), a DNA 
fragment of the predicted Bize was obtained with the 
chromosomal DNA from one of the isolates (designated S3). 
The PCR conditions used were as follows: 1 minute at 94°C; 2 
minutes at 40°C; 3 minutes at 72°C; 35 cycles. The DNA 
fragment generated in this way is used as a probe (following 
radiolabeling) to isolate the S3 glyphosate oxidoreductase gene 
candidate from a cos mid bank constructed as described for 
LBAA DNA and greatly facilitates the isolation of other 
glyphosate oxidoreductase genes. The primers used are 
homologous to internal sequences in the LBAA glyphosate 
oxidoreductase gene. The PCR conditions employed allow a fair 
degree of miff^ n ^ in the primers and the result suggests that 
the glyphosate oxidoreductase gene from S3 may not be as 
closely related to the other glyphosate oxidoreductase genes that 
were successfully isolated using the primers to the N- and C- 
termini of the LBAA gene. 

A variety of procedures are available for the isolation 
of genes. Some of these procedures are based on the knowledge 
of gene function that allow the design of phenotypic screens to 
aid in the isolation. Others are based on at least partial DNA 
sequence information that allow the use of probes or primers 
with partial or complete homology, or are based on the use of 
antibodies that detect the gene product. All of these options 
may be applied to the cloning of glyphosate oxidoreductase 
genes. 

Improvemen t ftf * h * Kinetic Properties of Glvphoaatfi 
Qjp HftTfrrlnctaae 

Prior examples of engineered herbicide resistance by 
enzymatic inactivation of the herbicide have utilized enzymes 
with an ability to bind and metabolize the herbicides much 
more efficiently than -glyphosate oxidoreductase metabolizes 
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glyphosate. The glyphosate oxidoreductase enzyme ha9 a K m for 
glyphosate of 20—30 mM and, as a result, the reaction rate for 
the degradation of glyphosate may be enhanced for optimal 
efficiency in transgenic plants by either lowering the K m or by 
raising the V max . 

Random mutagenesis techniques coupled with 
appropriate selections and/or screens are powerful tools which 
have been used successfully to generate large numbers of 
mutagenized gene sequences and potential variants. The same 
approaches may be used to isolate and to identify glyphosate 
oxidoreductase variants with improved glyphosate degradation 
efficiency. The mutagenesis techniques that may be employed 
include chemical mutagenesis of bacterial cultures containing 
the gene of interest or of purified DNA containing this gene and 
PCR methods used to generate copies of the gene (or portions of 
it) under conditions that favor misincorpo ration of nucleotides 
(errors) into the new strand: An example of such a condition 
would be carrying out the PCR reaction in the presence of Mn+*. 

Appropriate in vivo screens for improved variants 
following the mutagenesis could include those for improved 
glyphosate tolerance in E. eoli or increased growth on 
glyphosate in Mpu* strains. For the screen, the glyphosate 
oxidoreductase gene is cloned into a vector containing a weak 
bacterial promoter and/or in a replicon with a low copy 
number. The glyphosate tolerance phenotypes of different 
glyphosate oxidoreductase constructs have been shown to vary 
over a range of glyphosate concentrations and to correlate with 
the level of glyphosate oxidoreductase expression. For example, 
under uninduced conditions, PZac-glyphosate oxidoreductase 
vectors express less glyphosate oxidoreductase than PrecA- 
glyphosate oxidoreductase vectors and also display lower 
glyphosate tolerance. The mutagenized gene fragment is 
cloned into the most suitable vector and the resultant library 
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screencd. Variants are selected for their ability to grow at 
glyphosate level9 which inhibit growth of the control strain 
containing the parent glyphosate oxidoreductase clone. 
Glyphosate oxidoreductase activity confers on E. coli the ability 

5 to convert glyphosate to AMPA and, in suitable E. coli strains, 
this AMPA can provide a source of phosphate following 
cleavage of the C-P bond by C-P lyase. Suitable E. coli strains 
are B strains or Mpu* derivatives of K strains. The glyphosate 
oxidoreductase gene confers minimal growth on glyphosate as 

10 the sole phosphorus source in strain E. coli JM101 Mpu+ (= 
GB993). The growth rate on glyphosate has been shown to also 
correlate with the glyphosate oxidoreductase expression level. 
The xnutagenized glyphosate oxidoreductase gene is cloned into 
the appropriate vector and the variant library screened by 

15 differential growth rates on plates or by culturing in media 
containing glyphosate as sole phosphorous source. Clones 
which demonstrate faster growth on plates relative to the 
control strain are subsequently re-screened by growth curve 
analysis. 

2Q Glyphosate oxidoreductase variants which have been 

identified in each selection/screen are cloned into a vector for 
high-level expression and subjected to enzyme analysis to 
determine Km and V m *x values for glyphosate. The best 
glyphosate oxidoreductase variants are purified for complete 

25 kinetic characterization. Glyphosate oxidoreductase variants 
which have been identified with lower Km values and similar or 
higher V mmx values than wild-type enzyme values are analyzed 
by nucleic acid sequencing to determine the mutation(s). The 
goal in isolating variants would be to increase the kcat/Km ratio 

30 for glyphosate oxidoreductase-catalyzed glyphosate 
degradation. 

A variant with such improvements was isolated. 
The mutagenesis procedure used was that of Mn++-poisoned 
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PCR and the template was a linearized glyphosate 
oxidoreductase gene plasmid containing the synthetic 
glyphosate oxidoreductase gene (SEQ ID NO:8). The 
oligonucleotide primers used were homologous to regions in the 
5 vector and flanking the glyphosate oxidoreductase gene. The 
PCR conditions employed were as follows: 1 minute at 94°C, 2 
minutes at 55°C, and 3 minutes at 72°C and with 35 cycles. A 
5:1 ratio of dCTP+dGTP+TTP to dATP was used. The reactions 
contained MnCb at 125, 250, 375, or 500 uM. After the reaction, 
10 the amplified product was recloned into a vector cont ainin g a 
weak E. coli promoter. This vector was a pBR327 derivative 
containing the araBAD promoter and suitable cloning sites. 
One hundred colonies from this cloning step were then 
screened in E. coli GB993 for improved glyphosate tolerance and 
15 utilization phenotypes in media composed of MOPS minim al 
medium with glyphosate : and Pi or with glyphosate alone, 
respectively. Growth rates were determined by measuring Asso 
over a 96 hour period. Three clones were identified that 
exhibited faster growth rates in these screens. These 
20 transformants had a 1.5-2.0-fold faster utilization phenotype. 
The glyphosate oxidoreductase gene was recloned into the 
expression vector portion and this phenotype verified. All 
kinetic analysis was performed on crude E.coli lysates. 
Putative glyphosate oxidoreductase variant proteins were 
25 overexpressed after subcloning the NcoVHindlll variant 
glyphosate oxidoreductase gene into TrecA-genelOL expression 
vector. For overexpression in PrecA-genelOlj constructs, GB993 
cells containing the vector were induced at a Klett=110-120 in 
M9 TT.ir.imnl medium with 50 ug/ml nalidixic acid and allowed 
30 to grow for 2.5 hours at 37°C with vigorous shaking. Cells were 
harvested by centrifugation at 4000g, 5 minutes at 4°C, and 
resuspend in 100 mM Tris-HCl, pH 7.1. 1 mM EDTA, 35 mM 
KC1, 20% glycerol, and'l mM benzamidine at 3ml/g cell pellet. 
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LyBates were prepared by breaking the cells in a French press, 
twice, at 1000 psi. Insoluble debris was removed by 
centrifugation at 12000g, 15 minutes at 4°C, and the 
supernatant was de-salted by passing over a PD-10 column 
(Sephadex G-25, Pharmacia). The void volume fraction was 
used as the source of enzyme for kinetic Analysis. Protein 
concentrations were determined using the B:c-JcJ ad protein dye- 
binding assay. Time and enzyme concentration courses were 
performed to determine linear ranges. The enzyme assay was 
performed as follows: lysate' and glyphosate oxidoreductase mix 
(final concentration = 0.1 M MOPS, 0.01 M Tricine, pH 7.4, 0.01 
mM FAD, 10 mM MgCl 2 ) in a 100 ul reaction were pre- 
incubated at 30°C for 2 minutes prior to the addition of 
glyphosate (analytical grade stock prepared in water adjusted 
to pH 7.0 with NaOH). Ten minutes was determined to be the 
optimal time for the enzyme assay using 10 ug lysate. After 10 
minutes at 30°C with shaking, 0.25 ml dinitophenylhydrazine 
(DNPH) reagent (0.5 mg/ml in 0.5 M HC1) was added and the 
reaction was allowed to proceed for an additional 5 minutes at 
30°C with shaking. A 1.5 M NaOH solution <400ul) was then 
added to the assay mix, and the reaction was continued for 5 
minutes at 30°C with shaking. Enzyme activity was 
determined from the amount of glyoxylate-DNPH adduct 
formed by measuring A520 against a standard of glyoxylate. 
Enzyme assays are performed in duplicate on at least two 
different single colony isolates of a putative glyphosate 
oxidoreductase variant. To determine K m and V ma xt enzyme 
assays were performed over a (0.2-2.0) x K m range of glyphosate 
concentrations. The K m and V mmx were determined from 
Lineweaver Burk, Eadie-Hofstee and hyperbolic kinetic plots. 
V mmx waB estimated after determining the amount of 
immunoreactive glyphosate oxidoreductase protein in lysates by 
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immunoblot analysis as described below. Immunoblot analysis 
was performed following SDS-PAGE and transfer of protein 
from the gel to nitrocellulose at 500 mA in a Hoeffer transfer 
apparatus in 25 mM Tris-HCl, 192 mM glycine containing 0.1% 

5 SDS and 25% methanol for 1-2 hours. After transfer, the 
nitrocellulose was incubated with 50 mM Tris-HCl, pH7.5, 0.9% 
NaCl, 0.01% Tween 20, 0.02% NaN 3 containing 2% bovine 
serum albumin at room temperature with shaking for at least 
30 minutes. After blocking, the same buffer containing a 

10 1:25,000 dilution of goat anti-glyphosate oxidoreductase 
antiserum was added and the filter was allowed to shake at 
room temperature for 45 minutes. After incubation with 
primary glyphosate oxidoreductase antibody, the filter was 
washed for 45 glyphosate oxidoreductase minutes in buffer 

15 without antibody; buffer containing a 1:5000 dilution of rabbit 
anti-goat alkaline phosphatase-conjugated second antibody 
(from Pierce) was added and the filter was incubated for 45 
minutes at room temperature with shaking. The filter was 
then washed in buffer without antibody for 30 minutes prior to 

20 addition of NBT and BCIP (Promega) to allow color 
development. Immunoreactive glyphosate oxidoreductase 
protein was also quantitated by dot blotting the lysate onto 
nitrocellulose and then processing the filter as described above, 
except that 1251-Protein G was used for detection. The amount 

25 of glyphosate oxidoreductase protein in lysates was determined 
by counting the dot and comparing the amount of radioactivity 
against a glyphosate oxidoreductase protein standard. One 
variant, v.247, showed a 3-4-fold higher specific activity for 
glyphosate oxidoreductase at 25 mM glyphosate and the 
30 immunoblot analysis indicated that this was not due to an 
elevated glyphosate oxidoreductese protein level. Subsequent 
assays indicated that this variant had a 10-fold lower K m for 
glyphosate than the wild type glyphosate oxidoreductase. In a 
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similar manner the K m for IDA was also determined and these 
data are presented below. 

Kinetic analysis of ylvphoHate oxidoreductaae variants: 
5 app Km (mM) app V m (U/mg) 

Variant Olyp IDA <">lvn IDA GlVP IDA 

wild type 27.0 2.8 0.8 0.5 .03 .18 

v.247 2.6 0.7 0.6 0.7 .23 1.0 

IQ The glyphosate oxidoreductase gene from v.247 was 

sequenced (SEQ ID NO: 17) and five nucleotide changes were 
found. These changes are described in the following as they 
relate to the codons: GCT to GCC (codon 43), no amino acid 
change; AGC to GGC (codon 84), Ser to Gly; AAG to AGG 

15 (codon 153), Lys to Arg; CAC to CGC (codon 334), His to Arg. 
and CCA to CCG (codon 362), no amino acid change. The 
amino acid sequence of the glyphosate oxidoreductase gene 
from v.247 is presented as SEQ ID NO:18. The importance of 
these different amino acid changes was determined initially by 

20 recloning the altered regions into wild type glyphosate 
oxidoreductase and determining the effect on glyphosate 
oxidoreductase activity and kinetics. This was accomplished by 
recloning the Ncol-Nhel fragment (contains codon 84), the 
Nhel-ApalA fragment (contains codon 153), and the ApoLI- 

25 HindJU fragment (contains codon 334), separately into the wild 
type gene. These glyphosate oxidoreductase genes were then 
expressed and the kinetic analyses performed. The data are 
presented below and indicate that the change that occur ed in 
the ApaLI-Hfndlll fragment (contains codon 334) was 

30 responsible solely for the alteration in the enzyme. 
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W«^fin nnolvaia nf Hnmmn switches 




Clone 


app K m f raMI 


aDT>V« ( XJ/me) 




wt (wlw2w3*) 


28.4 


0.65 


0.022 


v.247(vlv2v3**) 


2.1 


0.72 


0.34 


wlv2w3 


23.5 


0.62 


0.026 


wlv2v3 


2.1 


0.6 


0.28 


wlw2v3 


2.0 


0.75 


0.375 


vlw2v3 


2.6 


0.55 


0.21 


vlw2w3 


28.0 


0.75 


0.027 


vlv2w3 


26.7 


0.55 


0.021 


* wl=SER84; w2=LYS153; w3=HIS334 




** vl=GLY84; v2=*ARG153; v3=ARG334 





This result was confirmed and extended by repeating 
the His to Arg change at codon 334 and introducing other 
specific changes at this residue by site-directed mutageneses. 
The primers used are listed in the following: am - cgttctctac 

ACTCGTGCTC GTAAGTTGC (SEQ ID NO: 19); IdtA - CGTTCTCTAC 
ACTAAGGCTC GTAAGTTGC (SEQ ID NO: 20); fiia - CGTTCTCTAC 
ACTCAAGCTC GTAAGTTGC (SEQ ID NO: 21) ; and AlA - CGTTCTCTAC 

actgctgctc gtaagttgc (seq id no: 22) (These sequences are the 
antisense to those actually used). The presence of these 
changes was confirmed by sequencing the mutagenized 
glyphosate oxidoreductase genes and a kinetic analysis of the 
expressed glyphosate oxidoreductase enzymes was performed. 
The data are presented in the following and show that a 
number of substitutions are possible at this position and which 
result in an enzyme with altered kinetic properties. 
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TTinoHr nn.nlvQiq of glvnVmsate nTidoreductase variants: 




app Km (mM) 


appV m 


(U/mg) . 


V 




Variant film 


H2A 


Olvp 


IDA 


film 


H2A 


wild type 27.0 


2.8 


0.8 


0.5 


.03 


.18 


v.247 2.6 


0.7 


0.6 


0.7 


.23 


1.0 


ARG334 2.6 


0.5 


0.6 


0.6 


.23 


1.2 


LYS 334 9.9 


1.3 


0.7 


0.8 


.07 


.62 


GLN334 19.6 


3.5 


0.6 


0.7 


.03 


.20 


ALA 334 26.7 


3.5 


0.2 


0.2 


.007 


.057 



30 



Additonal mutageneses were performed to change 
the His334 residue to other amino acids. The primers to 
accomplish this and the new codon are listed in the following: 

XXB - CTCTACACTTGGGCTCGTAAGCTTCTTCCAGC (SEQ ID NO: 23); 
Tin - CTCTACACTATCGCTCGTAAGCTTCTTCCAGC (SEQ ID NO: 24) ; 
LftU - CTCTACACTCTGGCTCGTAAGCTTCTTCCAGC (SEQ ID NO: 25) ; and 
GlU - CTCTACACTGAAGCTCGTAAGCTTCTTCCAGC (SEQ ID N0:26) 

(These sequences are the antisense of those actually used; these 
primers also add a "silent" Hin dHI that facilitates the 
identification of the mutagenized progeny from the population). 
The GLU334 variant retains substantial glyphosate 
oxidoreductase activity, while the TRP334, ILE334, and LEU334 
variants retain much less activity. 

From the first generation variants, those with the 
highest kcat/Km ratio are preferably subjected to a second round 
of mutagenesis followed by subsequent screening and analysis. 
An- alternative approach would be to construct second 
generation glyphosate oxidoreductase variants by combining 
single point mutations identified in the first generation 
variants. 
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pj.ANT TRANSFORMATION 

Plants which can be made glyphosate tolerant by 
practice of the present invention include, but are not limited to, 
5 soybean, cotton, corn, canola, oil seed rape, flax, sugarbeet, 
sunflower, potato, tobacco, tomato, wheat, rice, alfalfa, lettuce, 
apple, poplar and pine. 

A double-stranded DNA molecule of the present 
invention ("chimeric gene") can be inserted into the genome of 
10 a plant by any suitable method. Suitable plant transformation 
vectors include those derived from a Ti plasmid of 
Agrobacterium tumefaciens , as well as those disclosed, e.g., by 
Herrera-Estrella (1983), Bevan (1984), Klee (1985) and EPO 
publication 120,516 (Schilperoort et al.). In addition to plant 
15 transformation vectors derived from the Ti or root-inducing (Ri) 
plasmids of Agrobacterium, alternative methods can be used to 
insert the DNA constructs of this invention into plant cells. 
Such methods may involve, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free 
20 DNA delivery via microprojectile bombardment, and 
transformation using viruses or pollen. 

The pMON979 plant transformation/expression 
vector was derived from pMON886 (described below) by 
replacing the neomycin phosphotransferase typell (KAN) gene 
25 in pMON886 with the 0.89 kb fragment containing the bacterial 
gentamicin-3-N-acetyltransferase type III (AAC(3)-III) gene 
(Hayford et al., 1988). The chimeric P-35S/AA(3)-III/NOS 
3* gene encodes gentamicin resistance which permits selection 
of transformed plant cells. pMON979 also contains a 0.95 kb 
30 expression cassette consisting of the enhanced CaMV 35S 
promoter (Kay et al., 1987); several unique restriction sites, and 
the NOS 3' end (P-En-CaMV35S/NOS 3'). The rest of the 
pMON979 DNA segments are exactly the same as in pMON886. 
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Plasmid pMON886 is made up of the following 
segments of DNA. The first is a 0.93 kb Aval to 
engineered-JEcoRV fragment isolated from transposon Tn7 that 
encodes bacterial spectinomycin/streptomycin resistance 
5 (Spc/Str). which is a determinant for selection in E. coli and 
Agrobacterium tumefaciens. This is joined to the 1.61 kb 
segment of DNA encoding a chimeric kanamycin resistance 
which permits selection of transformed plant cells. The 
chimeric gene (P-35S/KAN/NOS 3') consists of the cauliflower 
10 mosaic virus (CaMV) 35S promoter, the neomycin 
phosphotransferase typell (KAN) gene, and the 
3*-nontranslated region of the nopaline synthase gene (NOS 3*) 
(Praley et al., 1983). The next segment is the 0.75 kb oriV 
containing the origin of replication from the RK2 plasmid. It is 
!5 joined to the 3.1 kb Sail to Pvul segment of pBR322 (ori322) 
which provides the origin of replication for maintenance in 
E. coli and the bom site for the conjugational transfer into the 
Agrobacterium tumefaciens cells. The next segment is the 0.36 
kb Pi; a I to Bell from pTiT37 that carries the nopaline- type 
20 7-DNA right border (Fraley et al, 1985). 

The pMON981 plasmid contains the following DNA 
segments: the 0.93 kb fragment isolated from transposon Tn7 
encoding bacterial spectinomycin/streptomycin resistance 
[Spc/Str; a determinant for selection in E. coli and 
25 Agrobacterium tumefaciens (Fling et al., 1985)]; the chimeric 
kanamycin resistance gene engineered for plant expression to 
allow selection of the transformed tissue, consisting of the 0.35 
kb cauliflower mosaic virus 35S promoter (P-35S) (Odell et al., 
1985), the 0.83 kb neomycin phosphotransferase typell gene 
30 (KAN), and the 0.26 kb S'-nontranslated region of the nopaline 
synthase gene (NOS 3') (Fraley et al., 1983); the 0.75 kb origin of 
replication from the RK2 plasmid (oriV) (Stalker et al., 1981); 
the 3.1 kb Sail to PuuTsegment of pBR322 which provides the 



92041559 



^1992 DERWEJN 1 FOtfLUJAirui NS LjIU 

WO 92/00377 PCT/US91/04514 

-54- 

origin of replication for maintenance in E. coli (ori-322) and the 
bom site for the conjugational transfer into the Agrobacterium 
tumefaciens cells, and the 0.36 kb Pvul to Bell fragment from 
the pTiT37 plasmid containing the nopaline-type T-DNA right 

5 border region (Fraley et al.. 1985). The expression cassette 
consists of the 0.6 kb 35S promoter from the figwort mosaic 
virus (P-FMV) (Gowda et al., 1989) and the 0.7 kb 3' non- 
translated region of the pea rbcS-E9 gene (E9 3*) (Coruzzi et al., 
1984, and Morelli et al., 1985). The 0.6 kb Sspl fragment 

10 containing the FMV35S promoter (Figure 1) was engineered to 
place suitable cloning sites downstream of the transcriptional 
start site. 

The plant vector was mobilized into the ABI 
Agrobacterium strain. The ABI strain is the A208 

15 Agrobacterium tumefaciens carrying the disarmed Ti plasmid 
pTiC58 (pMP90RK) (Koncz and Schell, 1986). The Ti plasmid 
does not carry the T-DNA phytohormone genes and the strain is 
therefore unable to cause the crown gall disease. Mating of the 
plant vector into ABI was done by the triparental conjugation 

20 system using the helper plasmid pRK2013 (Ditta et al., 1980). 
When the plant tissue is incubated with the ABI::plant vector 
conjugate, the vector is transferred to the plant cells by the vir 
functions encoded by the disarmed pTiC58 plasmid. The vector 
opens at the T-DNA right border region, and the entire plant 

25 vector sequence may be inserted into the host plant 
chromosome. The pTiC58 Ti plasmid does not transfer to the 
plant cells but remains in the Agrobacterium. 
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When adequate production of the glyphosate 
oxidoreductase activity iB achieved in transformed cells (ci 
protoplasts), the cella (or protoplasts) are regenerated into 
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whole plants. Choice of methodology for the regeneration step 
is not critical, with suitable protocols being available for hosts 
from Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae 
(carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, 

5 etc.), Cucurbitaceae (melons and cucumber), Gramineae 
(wheat, rice, corn, etc.), Solanaceae (potato, tobacco, tomato, 
peppers) and various floral crops. See, e.g., Ammirato, 1984; 
Shimamoto, 1989; Fromm, 1990; Vasil, 1990. 

The following examples are provided to better 

10 elucidate the practice of the present invention and should not be 
interpreted in any way to limit the scope of the present 
invention. Those skilled in the art will recognize that various 
modifications, truncations, etc, can be made to the methods and 
genes described herein while not departing from the spirit and 

15 scope of the present invention. 

EXAMPLES 

fiTp™>g*inn. A ctivity and Phenotvoe of Glvphosate 
20 Q*idflrediir t»AA in Tr»r>«fnrmpd Plants 

The transformation, expression and activity of 
glyphosate oxidoreductase, and the glyphosate tolerance 
phenotype imparted to the plants by the glyphosate 
oxidoreductase genes, introduced into Nicotiana tabacum cv. 
25 "SamBun" and/or Brassica napus cv. Westar using the vectors 
PMON17073, pMON17032, pMON17065, pMON17066, 
pMON17138 f and pMON17164, is described, in the following 
exemplary embodiments. . Initial data in tobacco on the 
expression of the manipulated glyphosate oxidoreductase gene 
30 (SEQ ID NO:6) under the control of the En-CaMV35S promoter 
(see data on pMON17073 in Tables VHI and DC, for example) 
indicated only low levels of expression of glyphosate 
oxidoreductase. The transcription of the gene was confirmed in 
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the case of 3-4 plants by Northern and SI analysis but no 
glyphosate oxidoreductase protein could be detected (limit of 
detection in that assay was -0.01% expression level). Analysis 
of Ro plants following spray with 0.4 lb/acre (approximately 
0.448 kg/ha) glyphosate also showed only low levels of tolerance. 
Modification of the gene sequence (as described herein) resulted 
in improved expression in tobacco, as did the use of the FMV 
promoter and the use of a CTP fusion to the glyphosate 
oxidoreductase gene. For these reasons the majority of the data 
presented comes from transgenic plants derived using vectors 
containing these improved glyphosate oxidoreductase 
constructs. One set of experiments with the modified 
glyphosate oxidoreductase vector pMON17032 are presented in 
example 1 and a study of manipulated glyphosate 
oxidoreductase, synthetic glyphosate oxidoreductase, and CTP1- 
synthetic glyphosate oxidoreductase is presented in example 2. 
The transformation and expression of glyphosate 
oxidoreductase in canola is described in example 3. 



20 fiTTttTTiplft 1 

The tobacco leaf £ *c transformation protocol employs 
healthy leaf tissue about 1 jl ath old. After a 15-20 minute 
surface sterilization with 10% Clorox plus a surfactant, the 
leaves were rinsed 3 times in sterile water. Using a sterile 
25 paper punch, leaf discs are punched and placed upside down 
on MS104 media (MS salts 4.3 g/1, sucrose 30 gfl, B5 vitamins 
506X 2 ml/1, NAA 0.1 mg/1, and BA 1.0 mg/1) for a 1 day 
pre culture. 

The discs were then inoculated with an overnight 
30 culture of disarmed Agrobacterium ABI containing the subject 
vector that had been diluted 1/5 (ie: about 0.6 OD). The 
inoculation was done by placing the discs in centrifuge tubes 
with the culture. After-30 to 60 seconds, the liquid was drained 
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off and the discs were blotted between sterile filter paper. The 
discs were then placed upside down on MS104 feeder plates 
with a filter disc to co-culture. 

After 2-3 days of co-culture, the discs were 
transferred, still upside down, to selection plates with MS104 
media. After 2-3 weeks, callus formed, and individual clumps 
were separated from the leaf discs. Shoots were cleanly cut 
from the callus when they were large enough to distinguish 
from stems. The shoots: were placed on hormone-free rooting 
media (MSO: MS salts 4.3 g/1, sucrose 30 g/1, and B5 vitamins 
500X 2 ml/1) with selection. Roots formed in 1-2 weeks. Any leaf 
callus assays are preferably done on rooted shoots while still 
sterile. Rooted shoots were placed in soil and were kept in a 
15 high humidity environment (ie: plastic containers or bags). 
The shoots were hardened off by gradually exposing them to 
ambient humidity conditions. 

A total of 45 Kanamycin resistant pMON17032 tobacco 
lines were examined (Table V). 



10 
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TftTiT.TC v . Tfrro™«mftn of Modified OlvphoBate Qridorftductaae 

Ctenfl in Tnhacco 
(Rl Transgenics of pMON17032) 

^3 GlypfaoMte Recallusing 

• punt* fp ifrnM yWiMat^ Wftiitern Annlynin of Planta* 

+ +/• - + 
45 0 11 34 24 21 

30 *+ means 0.5 - 2 ng/50 ug protein 

- means <0.5 ng/50 ug protein 
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Leaf recallusing on plant tissue culture media 
indicated a low level of glyphosate tolerance (rated as a +/- 
phenotype) for at least 11 of theBe lines. At least 24 of these lines 
expressed a detectable level of glyphosate oxidoreductase in the 
range of 0.5 to 2 ng per 50 . ng of extractable protein. The 
glyphosate tolerance displayed in the leaf recallusing assay and 
the higher glyphosate oxidoreductase expression level indicate 
that the changes made to the glyphosate oxidoreductase coding 
sequences to make the modified glyphosate oxidoreductase gene 
(SEQ ID NO:7) had a marked effect on the ability of this gene to 
be expressed in plants. This same effect could also then be 
achieved by expressing the manipulated glyphosate 
oxidoreductase gene (SEQ ID NO:6) using stronger plant 
promoters, using better 3' polyadenylation signal sequences, 
optimizing the sequences around the initiation codon for 
ribosome loading and translation initiation, or by combinations 
of these or other expression or regulatory sequences or factors. 
The Rl progeny of a number of these lines, including those with 
the highest glyphosate oxidoreductase expression level (# f s 
18854 and 18848) were sprayed with glyphosate at rates of 0.4 
and 1.0 lb/acre (0.448 and 1.12 kg/ha, respectively) and 
vegetative performance rated over a period of four weeks (Table 
VD. 
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TABT,tt VT - T fth » ppo sp rflV Dflta fQr PMQN17032 Rl Plants 

Vegetative Score * 





Rate 
kg/ha 


7 Dava 


14 Dava 


28 Dava 


iuuuv 


0.448 


3 


3 


4 




1.12 


1 


1 


2 


18842 


0.448 


4 


6 


8 




1.12 


2 


3 


6 


18848 


0.448 


3 


4 


8 




1.12 


2 


2 


6 


18854 


0.448 


4 


7 


9 




1.12 


2 


5 


8 


18858 


0.448 


3 


4 


6 




1.12 


1 


2 


4 


18885 


0.448 


4 


5 


8 




1.12 


2 


1 


2 


18890 


0.448 


3 


6 


7 




1.12 


1 


2 


3 


Samsun 


0.448 


i 


1 


2 




1.12 


1 


1 


o 


* Vecretativo Score 










OsDead 










10 a No detectable effect 







Following an initial lag, and especially for those 
plants expressing the highest levels of glyphosate 
oxidoreductase, these lines showed vegetative glyphosate 
tolerance at both spray rates (that improved with time). 
Glyphosate oxidoreductase enzyme activity was determined for 
two of the pMON17032 lines (ffa 18858 and 18 ort l). Leaf tissue 
(lg) was harvested, frozen in liquid N 2 , and stored at -80°C prior 
to extraction. For extraction, leaf tissue was pulverized in a 
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mortar and pestle with liquid N 2 . To the powdered leaf tissue 
was then added 1 ml extraction buffer (100 mM TrisCl, pH 7.4, 1 
mM EDTA, 20% glycerol, 35 mM KC1, 1 mM benzamidine HC1, 
5 mM Na ascorbate, 5 mM dithiothreitol, and 1 mg/ml bovine 
serum albumin, 4°C), and the sample was further ground for 1 
minute. The resulting mixture was centrifuged for 5 minutes 
(high speed, Eppendorf) and the supernatant was treated with a 
saturated ammonium sulfate solution to give 70% final 
saturation (2.33 ml saturated solution/ml extract). The 
precipitated protein was collected by centrifugation as above, 
and the pellet was resuspended in 0.4 ml of extraction buffer. 
After centrifuging again to remove particulate matter, the 
sample was desalted using Sephadex G50 contained in a 1 ml 
syringe, equilibrated with extraction buffer, according to the 
method of Penefsky (1979). The desalted plant extracts were 
stored on ice, and protein concentrations were determined by 
the method of Bradford (1976). Glyphosate oxidoreductase 
reactions were carried out in duplicate for 60 minutes at 30°C in 
an assay mixture of 0.1 MOPS/0.01 tricine buffer, pH 7.4, 
containing 10 mM MgCl 2 , 0.01 mM flavin adenine trinucleotide 
(FAD, Sigma), and 1 mM ubiquinone Qo, (Sigma). Plant 
extracts (75 pi) were preincubated in the assay mixture for 2 
minutes, and reactions were then initiated by adding 
iminodiacetic acid (IDA, 20 ul) substrate to a final 
concentration of 50 mM (total assay volume was 0.2 ml). 
Reactions were quenched and derivatized as' described below. 
Control reactions omitting IDA and omitting plant extract were 
also performed. Glyoxylate detection was carried out using 2,4- 
dinitrophenylhydraadne (2,4-DNPH) derivatization and reverse 
phase high performance liquid chromotography (HPLC), using 
a modification of the method of Qureshi et al. (1982). 
Glyphosate oxidoreductase reactions (0.2 ml) were quenched 
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with 0.25 ml of DNPH reagent (0.5 mg/ml DNPH [Aldrich] in 
0.5 M HC1) and allowed to derivatize for 5 minutes at 25°C. The 
samples were then extracted with ethyl acetate (2 x 0.3ml) and 
the combined ethyl acetate extracts were extracted with 10% 
Na 2 C0 3 (0.3 ml). The NaaCOs phase was then washed once 
with ethyl acetate (0.2 ml) and the Na 2 C03 phase injected (100 
pj) on a Beckman Ultrasphere C18 IP HPLC column (5 \l 9 4.6 
mm x 25 cm) using an LKB GTi binary HPLC system with a 
Waters 990 photodiode array UV/VIS HPLC detector, via a 
Waters WISP HPLC autoinjector. The isocratic mobile phase 
was methanol-water-acetic acid (60:38.5:1.5) with 5 mM 
tetrabutylammonium phosphate (Pierce). The DNPH- 
glyoxylate peak (retention time =* 6.7 minutes) was detected at 
365 nm and compared to a glyoxylate standard (Sigma, 20 jiM 
in 0.2 ml) derivatized in exactly the same manner. 

TARLK Vn - QlvphnBftte oiridor eductase Activity of Transgenic 

Tnhnceo Plants 

Plant Specific Activity 

nmol/min mg 

Saxnsun 0 (not detectable) 

18881 0.039 
18858 0.018 



A series of transformed tobacco lines were derived 
30 using the -isogenic" glyphosate oxidoreductase vectors 
pMON17073 (manipulated glyphosate oxidoreductase) (SEQ ID 
NO:6), pMON17065 (synthetic glyphosate oxidoreductase) (SEQ 
ID NO:8), and pMO"N17066 (CTPl-synthetic glyphosate 
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oxidoreductase). By Western analysis (see Table VII below) of a 
number of these lines, the manipulated glyphosate 
oxidoreductase plants were found to express up to -0.5 ng 
glyphosate oxidoreductase per 50 \ig plant protein, the synthetic 
glyphosate oxidoreductase at levels from -0.5 - 2 ng per 50 )ig, 
and at levels from ~2 - 20 ng per 50 \ig for the CTP1- synthetic 
glyphosate oxidoreductase plants. 
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TABTJE VTTJ - fllvphnflnt* Oxidoreductase 
F/rnrossion in Tobacco 

PftTifitruet Plants Wftatern Rating 

pMON17073 21270 
(manipulated) 21281 
21286 



21929 



0 
0 
1 
1 



10 



15 



pMON17066 

(CTP1- 

synthetic) 



21237 
21830 
21845 
21872 
21889 
21891 



1 
0 
3 
3 
1 
0 



pMON17065 21199 0 
(synthetic) 21208 2 
21211 2 
20 21217 0 

21218 2 
21792 1 
21795 0 
21811 2 
25 Western rating scale per 50 |ig of protein: 

0 -.no detectable glyphosate ozidoreductase 

1 - <.5ng . * 

2 - .5ng - 2ng 

3 - >2ng 

30 ^ number of : primary transformants R 0 lines, 

expressing manipulated or synthetic glyphosate ozidoreductase 
or CTPl-synthetic glyphosate ozidoreductase, were sprayed 
with glyphosate at 0.4 lb/acre (0.448 kg/ha) and rated as before. 
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TABLE TX - Olvnhnsate Snrav Data; pMONl70fifi 
(CTPl-Glvphoaate Qxidoreductase) 
Tobacco CRq plants) 



10 



Ling 

Control A 
Control B 
Control C 



Western Rating 
o 
o 
o 



(Snrttv Rati* = 0.4 lh/acr»> (0 44fl Wp/hflt 



H 
0 
X 
1 



2&1 

0 no detectable 

0 gfrphosate 

1 oxidoreductaae 



15 



20 



25 



30 



22933 


1 


3 


1 


0 


(pMON17073) 


22741 


2 


2 


1 


9 


(pMON 17065) 


22810 


3 


3 


4 


6 


(pMON 17066) 


22825 


1 


2 


1 


1 


(pMON 17066) 




3 


10 


10 


10 


(PMON17066) 


22044 


3 


10 


10 


10 


(pMON17066) 


22854 


3 


9 


10 


10 


(PMON17066) 


22860 


3 


8 


10 


10 


(pMON17066) 


22880 


1 


; 3 


2 


9 


(PMON17066) 


22881 


2 


2 


0 


0 


(pMON17066) 


22886 


3 


9 


10 


10 


(pMON 17066) 


22887 


3 


9 


10 


10 


(PMON17066) 



Western rating scale 

(per 50 ug protein) 

0 s'no detectable gryphoaate oxidoreductase 

1 s <0.5ng 

2 = 0.5 - 2ng 

3 = >2ng 

# Vegetative score: 0 = dead; 

10 = no detectable effect 
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The synthetic glyphosate oxidoreductase line 
displayed a response similar to that noted for the modified 
glyphosate oxidoreductase Ri plants, in that there was some 
immediate glyphosate effects that were overcome with time, 
through the metabolism of the herbicide by glyphosate 
oxidoreductase to the derivatives AMP A and glyoxylate. Since 
the target of glyphosate (EPSP synthase) is located in the 
chloroplast, the activity of glyphosate oxidoreductase must be 
reducing the level of glyphosate within this organelle by 
removing the herbicide before it reaches the chloroplast. The 
CTPl-synthetic glyphosate oxidoreductase plants displayed a 
superior glyphosate tolerance in that these plants did not show 
much, if any, immediate glyphosate effects at the treated rate. 
In general, the treated tolerant plants also showed normal 
development, flowering and fertility. 

The CTPl-synthetic glyphosate oxidoreductase plants 
showed a markedly higher level of glyphosate oxidoreductase 
expression than that shown for the other glyphosate 
oxidoreductase constructs. This increased glyphosate 
oxidoreductase level could be due to enhancement of translation 
of the fusion or to sequestering of glyphosate oxidoreductase 
within the chloroplast and leading to a longer protein half-life. 
The higher level of glyphosate oxidoreductase and/or its 
location in the chloroplast can result in higher levels of 
glyphosate tolerance through rapid detoxification of glyphosate 
in the chloroplast. The presence of glyphosate oxidoreductase 
within the chloroplast has been confirmed. Five leaves from 
each of four plants (#22844, 22854, 22886, 22887), shown to be 
Western positive for glyphosate oxidoreductase, were 
homogenized in Waring blender in 0.9 L GR+ buffer (Bartlett, et 
al., 1982) for 3 X 3 seconds at high speed. The homogenate was 
filtered through 4 layers ofMiracloth and centrifiiged at 6,000 
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rpm in a GS-3 rotor. The pellet was resuspended in 4 ml total of 
GR+ buffer and placed on top of a 40/80% Percoll step gradient 
and spun at 9,500 rpm for 10 minutes. The intact chloroplasts 
(lower band) were washed once with GR- buffer (Bartlett, et al., 
1982) and centrifuged (up to 6,000 rpm with brake off). They 
were then resuspended in 300 ul 50 mM Hepes pH 7.7, 330 mM 
Sorbitol and lysed on ice using by soni cation (small probe, 30%-3 
microtip setting x 10 seconds). The debris was pelleted and the 
supernatant passed through a Sephadex G50 column into 50 
mM Hepes, pH 7.5. The soluble protein concentration was 2.4 
mg/ml. The enzyme assays were done as above using both 50 
mM IDA and 50 mM glyphosate as substrates (30 minute 
assays), but without the addition of 1 mM ubiquinone. 

Table TX - Olvphosate Oxidoreductase Activity in Isolated 
Chloronlast from Transgenic Tobacco 

Substrate SMCifiC Activity 

30 (nmoles/min.mg) 

Iminodiacetic acid 179 
Glyphosate 92 

A number of transformed lines of canola have been 
derived with vectors pMON17138 (CTPl-synthetic glyphosate 
oxidoreductase) and pMON17164 (CTP2-syrithetic glyphosate 
oxidoreductase) as follows. 



15 



30 



Material 

Seedlings of Brassica napus cv Westar went 
established in 2 inch (>- 5 cm) pots containing Metro Mix 350. 
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They were grown in a growth chamber at 24°C, 16/8 hour 
photoperiod, light intensity of 400 uEm-2 S ec-i (HID lamps). 
They were fertilized with Peters 20-10-20 General Purpose 
^ Special. After 2 1/2 weeks they were transplanted to 6 inch (~ 15 
cm) pots and grown in a growth chamber at 15/10°C day/night 
temperature, 16/8 hour photoperiod, light intensity of 800 uEm- 
2sec-i (HID lamps). They were fertilized with Peters 15-30-15 Hi- 
Phos Special. 

10 

Transformation/Selection/Regeneration 

Four terminal internodes from plants just prior to 
bolting or in the process of bolting but before flowering were 
removed and surfaced .sterilized in 70% v/v ethanol for 1 

^ minute, 2% w/v sodium hypochlorite fcr 20 minutes and rinsed 
3 times with sterile deioniz^d water. Stems with leaves 
attached could be refrigerated in moist plastic bags for up to 72 
hours prior to sterilisation. Six to seven stem segments were 
cut into 5mm. Uses with a Redco Vegetable Slicer 200 
maintaining orientation of basal end. 

The Agrobacterium was grown overnight on a rotator 
at 24°C in 2mls of Luria Broth containing 50mg/l kanamycin, 
24mg/l chloramphenicol and 100mg/l spectincmycin. A 1:10 
dilution was made in MS (Murashige and Skoog) media giving 

25 approximately 9x10* cells per ml. This was confirmed with 
optical density readings' at 660 mu. The stem discs (explants) 
were inoculated with 1.0ml of Agrobacterium and the excess 
was aspirated from the explants. 

The explants were placed basal side down in petri 

30 plates containing 1/10X standard MS salts, B5 vitamins, 3% 
sucrose, 0.8% agar, pH 5.7, 1.0mg/l 6-benzyladenine (BA). The 
plates were layered with 1.5ml of media containing MS salts, B5 
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vitamins, 3% sucrose, pH 5.7, 4.0mg/l p-chlorophenoxyacetic 
acid, 0.005mg/l kinetin and covered with sterile filter paper. 

Following a 2 to 3 day co-culture, the explants were 
transferred to deep dish petri plates containing MS salts, B5 
vitamins, 3% sucrose, 0.8% agar, pH 5.7, lmg/1 BA, 500mg/l 
carbenicillin, 50mg/l cefotaxime, 200 mg/1 kanamycin or 
175mg/l gentamicin for selection. Seven explants were placed 
on each plate. After 3 weeks they were transferred to fresh 
media, 5 explants per plate; The explants were cultured in a 
growth room at 25°C, continuous light (Cool White). 

Expression Assay 

After 3 weeks shoots were excised from the explants. 
Leaf recall using assays were initiated to confirm modification 
of Ho shoots. Three tiny pieces of leaf tissue were placed on 
re call using media containing MS Baits, B5 vitamins, 3% 
sucrose, 0.8% agar, pH 5.7, 5.0mg/l BA, 0.5mg/l naphthalene 
acetic acid (NAA), 500mg/l carbenicillin, 50mg/l cefotaxime 
and 200mg/l kanamycin or gentamicin or 0.5mM glyphosate. 
The leaf assays were incubated in a growth room under the 
same conditions as explant culture. After 3 weeks the leaf 
recall using assays were scored for herbicide tolerance (callus 
or green leaf tissue) or sensitivity (bleaching). 

Transplantation 

At the time of excision, the shoot stems were dipped 
in Rootone0 and placed in 2 inch (~ 5 cm) pots containing 
Metro-Mix 350 and placed in a closed humid environment. 
They were placed in a growth chamber at 24°C, 16/8 hour 
photoperiod, 400 uEm-isec-2(HID lamps) for a hardening-off 
period of approximately^ weeks. 
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The seed harvested from R 0 plants ia Ri seed which 
gives rise to R\ plants. To evaluate the glyphosate tolerance of 

5 an R 0 plant, its progeny are evaluated. Because an Rq plant is 
assumed to be hemizygous at each insert location, selfing 
results in maximum genotypic segregation in the Ri. Because 
each insert acts as a dominant allele, in the absence of linkage 
and assuming only one hemizygous insert is required for 

10 tolerance expression, one insert would segregate 3:1, two 
inserts, 15:1, three inserts 63:1, etc. Therefore, relatively few Ri 
plants need be grown to find at least one resistant phenotype. 

Seed from an R*> plant is harvested, threshed, and 
dried before planting in a glyphosate spray test. Various 

15 techniques have been used to grow the plants for Ri spray 
evaluations. Tests are conducted in both greenhouses and 
growth chambers. Two planting systems are used; - 10 cm pots 
or plant trays containing 32 or 36 cells. Soil used for planting is 
either Metro 350 plus three types of slow release fertilizer or 

20 

plant Metro 350. Irrigation is either overhead in greenhouses 
or subirrigation in growth chambers. Fertilizer is applied as 
required in irrigation water. Temperature regimes 
appropriate for canola were maintained. A sixteen hour 
photoperiod was maintained. At the onset of flowering, plants 

25 

are transplanted to -15 cm pots for seed production. 

A spray "batch** consists of several sets of Ri 
progenies all sprayed on the same date. Some, batches may also 
include evaluations of other than Ri plants. Each batch also 
30 includes sprayed and unsprayed non-transgenic genotypes 
representing the genotypes in the particular batch which were 
putatively transformed. Also included in a batch is one or more 
non-segregating transformed genotypes previously identified as 
having som resistance. 
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T wo- six plants from each individual R 0 progeny are 
not sprayed and serve as controls to compare and measure the 
glyphosate tolerance, as well as to assess any variability not 
induced by the glyphosate. When the other plants reach the 2-4 
leaf stage, usually 10 to 20 days after planting, glyphosate is 
applied at rates varying from 0.28 to 1.12 kg/ha, depending on 
objectives of the study. Low rate technology using low volumes 
has been adopted. A laboratory track sprayer has been 
calibrated to deliver a rate equivalent to field conditions. 

A scale of 0 to 10 is used to rate the sprayed plants for 
vegetative resistance. The scale is relative to the unsprayed 
plants from the same R 0 plant. A 0 is death, while a 10 
25 represents no visible difference from the unsprayed plant. A 
higher number between 0 and 10 represents progressively less 
damage as compared to the unsprayed plant. Plants are scored 
at 7, 14, and 28 days after treatment (DAT), or until bolting, and 
a line is given the average score of the sprayed plants within an 
20 Ro plant family. 

Six integers are used to qualitatively describe the 
degree of reproductive damage from glyphosate: 
0: No floral bud development 
2: Floral buds present, but aborted prior to 
25 opening 

4: Flowers open, but no anthers, or anthers fail 

to extrude past petals 
6: Sterile anthers 
8: Partially sterile anthers 
30 10: Fully fertile flowers 

Plants are scored using this scale at or shortly after 
initiation of flowering, depending on the rate of floral structure 
development. 



92041559 




WQ 92/00377 



PCT/US9 1/0451 4 



-71- 



Tables X and XI below tabulate the vegetative and 
reproductive scores for canola plants transformed with 
pMON17138 (sprayed at a rate of 0.56 kg/ha and pMON17164 
(sprayed at a rate of 0.84 kg/ha), respectively. The results 
presented below illustrate the glyphosate tolerance conferred to 
canola plants as a result of expression of a glyphostate 
oxidreductase gene in the plants. 



10 



Table X 



Glvnhosat e finrav Evaluation of Canola Plants 
containing nMON17138 



15 



20 



Line name 



Batch O KB kg/ha score 0.56kp/ha arore 







14 DAT 


2&JQAX 






Vegetative 


Reproductive 


17138-22 


79 


9 


10 


17138-30 


79 


9 


10 


17138-145 


79 


10 


10 


17138-158 


79 


8 


10 


17138-164 


80 


8 


10 


Untransformed 


77 


3 


0 


Untransformed 


79 


1 


0 



25 



30 



n o n' » a r r n 
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Tflhlft YT ■ Olvphoaate Snrav Evaluation of Canola Plants 
containing pMQN17164 

Q,84 ke/ha score 
Construct Batch 14 DAT 28 DAT 

vegetative reproductive 
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17164-6 


82 


7 


10 


17164-9 


CO 


o 


ID 


1/ J.D4-ZU 


QO 


7 


in 


OC 


QQ 


8 


10 


17164-35 


84 


7 


10 


17164-45 


83 


9 


10 


17164-61 


83 


7 


10 


17164-75 


84 


7 


10 


17164-85 


84 


7 


10 


17164-97 


84 


6 


10 


17164-98 


83 


9 


10 


17164-105 


,83 


7 


10 


17164-110 


83 


9 


10 


17164-115 


83 


7 


10 


17164-129 


83 


8 


10 


17164-139 


84 


7 


10 


17164-140 


83 


8 


10 


17164-164 


83 


7 


10 


17164-166 


83 


8 


10 


17164-174 


83 


8 


10 


17164-186 


83 


3 


10 


17164-202 


83 


8 


10 


17164-218 


84 


6 


10 


17164-219 


83 


9 


10 


17164-222 


84 


7 


10 
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17164-225 


83 


; 7 


10 


17164-227 


BA 


7 


10 


17164-230 


83 


8 


10 


17164-243 


83 


7 


10 


17164-247 


84 


7 


10 


17164-287 


84 


7 


10 


17164-289 


83 


8 


10 


17164-300 


83 


9 


10 


17164-337 


83 


8 


10 



10 



The glyphosate oxidoreductase gene has also been 
introduced into and expressed in soybean and imparts 

^ glyphosate tolerance to such plants. The CTP2-synthetic 
glyphosate oxidoreductase fusion gene (as described above) was 
introduced into soybean under the control of the FMV promoter 
and with the NOS 3' sequences in vector pMON17159 t a map of 
which is presented in Figure 10. This vector consists of the 

^ following elements in addition to the glyphosate oxidoreductase 
gene sequences; the pUC origin of replication, an NPTII 
bacterial selectable marker gene (kanamycin) and the beta- 
glucuronidase gene (GUS; Jefferson et a. 1986) under the 
control of the E35S promoter and with the E9 3' sequences. The 

^ latter gene provides a scorable marker to facilitate the 
identification of transformed plant material. 

Soybean plants are transformed with pMON17159 by 
the method of microprojectile injection using particle gun 
technology as described in Christou et al. (1988). The seed 

30 harvested from R 0 plants is Ri seed which gives rise to Ri 
plants. To evaluate the glyphosate tolerance of an Ro plant, its 
progeny are evaluated.. Because an Ro plant is assumed to be 
hemizygous at each insert location, selfing results in 
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maximum genotypic segregation in the Ri. Because each 
insert acts as a dominant allele, in the absence of linkage and 
assuming only one hemizygous insert is required for tolerance 
expression, one insert would segregate 3:1, two inserts, 15:1, 
three inserts 63:1, etc. Therefore, relatively few Ri plants need 
be grown to End at least one resistant phenotype. 

Seed from an R© soybean plant is harvested, and 
IQ dried before planting in a glyphosate spray test. Seeds are 
planted into 4 inch (-5cm) square pots containing Metro 350. 
Twenty seedlings from each Ro plant is considered adequate for 
testing. Plants are maintained and grown in a greenhouse 
environment. A 12.5*14 hour photoperiod and temperatures of 
j5 30°C day and 24°C night is regulated. Water soluble Peters Pete 
Lite fertilizer is applied as needed. 

A spray "batch" ' consists of several sets of Ri 
progenies all sprayed on the same date. Some batches may also 
include evaluations of other than Ri plants. Each batch also 
20 includes sprayed and unsprayed non-transgenic genotypes 
representing the genotypes in the particular batch which were 
putatively transformed. Also included in a batch is one or more 
non-segregating transformed genotypes previously identified as 
having some resistance. 
25 One to two plants from each individual R 0 progeny 

are not sprayed and serve as controls to compare and measure 
the glyphosate tolerance, as well as to assess any variability not 
induced by the glyphosate. When the other plants reach the 
first trifoliolate leaf stage, usually 2-3 weeks after planting, 
30 glyphosate is applied at a rate equivalent of 128 oz./acTe 
(8.895kg/ha) of Roundup®. A laboratory track sprayer has been 
calibrated to deliver a rate equivalent to those conditions. 

A vegetative" score of 0 to 10 is used. Th score is 
relative to the unsprayed progenies from the same R G plant. A 
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0 is death, while a 10 represents no visible difference from the 
unsprayed plant. A higher number between 0 and 10 
represents progressively less damage as compared to the 
unsprayed plant. Plants are scored at 7, 14, and 28 days after 
treatment (DAT). 



10 



15 



Table XII - Qlvnhosa te Sprav Evaluation of Soybean Plants 

fflmtnining PMQN17159 



20 



Line Batch 

17159-24 14 

17159-25 14 

17159-28 14 

17159-40 14 

17159-43 14 

17159-71 14 

17159-77 14 

17159£1 15 

Untransformed * 14 



Score @ 8,895ke/ha. 2B DAT 
9 

6 

4 

4 

10 

9 

4 

0 



25 



30 



FiTamnlft ft 

The glyphoBate oxidoreductase gene has also been 
introduced into Black Mexican Sweet (BMS) corn cells with 
expression of the protein detected in callus. 

Plaomid pMON19632 was used to introduce the 
glyphosate oxidoreductase gene into corn cells. The backbone 
for this plasmid was constructed by inserting the 0.6kb 
cauliflower mosaic virus (CaMV) 35S UNA promoter (E35S) 
containing a duplication of the -90 to -300 region (Kay et al.» 
1987), a 0.58kb fragment containing the first intron from the 
maize alcohol dehydrogenase gene (Callis et al., 1987), and the 
3' termination sequences from the nopaline synthase (NOS) 
gene (Fraley et al., 1983) into pUC119 (Yanisch-Perron et al„ 
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1985), pMON19632 was formed by inserting the 1.7kb 
Bglll/EcoRI fragment from pMON17064 which contains the 
Arabidopsis SSU CTP fused to the synthetic glyphosate 
oxidoreductase coding sequence (SEQ IN NO:8>. 

Plasmid pMON 19632 was introduced into BMS corn 
cells by co-bombardment with EC9, a plasmid containing a 
sulfonylurea-resistant form of the maize acetolactate synthase 
gene. 2.5 ug of each plasmid was coated onto tungsten particles 
and introduced into log-phase BMS cells using a PDS-1000 
particle gun essentially as described in Klein et al., 1989. 
Transformants were selected on MS medium containing 20ppb 
chlorsulfuron. After initial selection on chlorsulfuron, the calli 
was assayed by glyphosate oxidoreductase Western blot. 

BMS callus (3 g wet weight) was dried on filter paper 
( Whatman# 1 ) under vacuum, reweighed, and extraction buffer 
(500 ul/g dry weight; 100 mM Tris, 1 mM EDTA, 10% glycerol) 
was added. The tissue was homogenized with a Wheaton 
overhead stirrer for 30 seconds at 2.8 powrr setting. After 
centrifugation (3 minutes, Eppendorf microfuge), the 
supernatant was removed and the protein was quantitated 
(BioRad Protein Assay). Samples (50 ug/well) were loaded on 
an SDS PAGE gel (Jule, 3-17%) along with glyphosate 
oxidoreductase standard (10 ng) ? electrophoresed, and 
transferred to nitrocellulose similarly to a previously described 
method (Padgett©, 1987). The nitrocellulose blot was probed 
with goat anti-glyphosate oxidoreductase IgG, and developed 
with 1-125 Protein G. The radioactive blot was visualized by 
autoradiography. Results were quantitated by densitometry on 
an LKB UltraScan XL laser densitomer and are tabulated below 
in Table Xm. 
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Tahte Xm 



Egression of glyphosate oxidoreductflfle in BMS 

Corn Callus using PMON19632 



10 



15 



EC9 (no GOX) 

T13-17 

T13-16 

T13-15 

T13-14 

T13-12 

H3-7 

T13-5 

T13-18 

T13-S 

T13-9 

T13A 



GQS expression 
f ft extracted protein) 

o 

0.016 

0.0065 

0.016 

0.003 

0.0079 

0.01 

0.004 

0.026 

0.019 

0.01 

0.027 



20 



Table Xm illustrates that glyphosate oxidoreductase 
can be expressed and detected in a xnonocotyledonous plant, 
such as corn. . 
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The glyphosate oxidoreductase gene may be used as a 
selectable marker for plant transformation directly on media 
containing glyphosate. The ability to select and to identify 
transformed plant material depends, in most cases, on the use 
of a dominant selectable marker gene to enable the preferential 
and continued growth of the transformed tissues in the 
presence of a normally inhibitory substance. Antibiotic 
resistance and herbicide tolerance genes have been used almoBt 
exclusively as such dominant selectable marker genes in the 
presence of the corresponding antibiotic or herbicide. The 
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nptll/kanamycin selection scheme is probably the most 
frequently used. It has been demonstrated that glyphosate 
oxidoreductase is also a useful and perhaps superior selectable 
marker/selection scheme for producing and identifying 
transformed plants. 

A plant transformation vector that may be used in 
this scheme is pMON17226 (Figure 11). This plasmid 
resembles many of the other plasmids described infra and is 
essentially composed of the previously described bacterial 
replicon system that enables this plasmid to replicate in E, coli 
and to be introduced into and to replicate in Agrobacterium, the 
bacterial selectable marker gene (Spc/Str), and located between 
the T-DNA right border and left border is the CTPl-glyphosate 
15 oxidoreductase synthetic gene in the FMV promoter-E9 3' 
cassette. This plasmid also has single sites for a number of 
restriction enzymes; lpcated within the borders and outside of 
the expression cassette. This makes it possible to easily add 
other genes and genetic elements to the vector for introduction 
into plants. 

The protocol for direct selection of transformed plants 
on glyphosate is outlined for tobacco. Explants are prepared for 
pre-culture as in the standard procedure as described in 
Example 1: surface sterilization of leaves from 1 month old 
25 tobacco plants (15 minutes in 10% clorox + surfactant; 3X dH20 
washes); explants are cut in 0.5 x 0.5 cm squares, removing leaf 
edges, mid-rib, tip, and petiole end for uniform tissue type; 
explants are placed in single layer, upside down, on MS104 
plates + 2 ml 4C005K media to moisten surface; pre-culture 1-2 
days. Explants are inoculated using overnight culture of 
Agrobacterium containing the plant transformation plasmid 
that is adjusted to a titer of 1.2 X 10* bacteria/ml with 4C005K 
media. Explants are placed into a centrifuge tube, the 
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Agrobacterium suspension is added and the mixture of bacteria 
and explants is "Vortexed" on maximum settting for 25 seconds 
to ensure even penetration of bacteria. The bacteria are poured 
5 off and the explants are blotted between layerB of dry sterile 
filter paper to remove excess bacteria. The blotted explants are 
placed upside down on MS104 plates + 2ml 4C005K media + 
filter disc. Co-culture is 2-3 days. The explants are transferred 
to MS104 + Carbenicillin 1000 mg/1 + cefotaxime 100 mg/1 for 3 
10 days (delayed phase). The explants are then transferred to 
MS104 + glyphosate 0.05 mM + Carbenicillin 1000 mg/1 + 
cefotaxime 100 mg/1 for selection phase. At 4-6 weeks shoots are 
cut from callus and placed on MSO + Carbenicillin 500 mgA 
rooting media. Roots form in 3-5 days , at which time leaf pieces 
15 can be taken from rooted plates to confirm glyphosate tolerance 
and that the material is transformed. 

The presence of ■ the glyphosate oxidoreductase 
protein in these transformed tissues has been confirmed by 
immunoblot analysis of leaf discs. The data from one 
20 experiment with pMON17226 is presented in the following: 25 
shoots formed on glyphosate from 100 explants inoculated with 
Agrobacterium ABI/pMON17226; 15 of these were positive on 
recallusing on glyphosate, and 19 of these were positive for 
glyphosate oxidoreductase protein as detected by immunoblot. 
25 These data indicate a transformation rate of 15-19 per 100 
explants, which makes this a highly efficient and time saving 
transformation procedure for plant. Similar transformation 
frequencies have been obtained with a pMON17226 derivative 
(pMON17241) containing the gene for the glyphosate 
30 oxidoreductase v.247 (SEQ ID NO:17). The glyphosate 
oxidoreductase gene has also been shown to enable direct 
selection of transformants in other plant species, including 
Arabidopsis, potato, and sugarbeet. 
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From the foregoing, it will be seen that this invention 
is one well adapted to attain all the ends and objects 
hereinabove set forth together with advantages which are 
obvious and which are inherent to the invention. 

It will be understood that certain features and 
subcombinations are of utility and may be employed without 
reference to other features and subcombinations. This is 
contemplated by and is within the scope of the claims. 

Since many possible embodiments may be made of 
the invention without departing from the scope thereof, it is to 
be understood that all matter herein set forth or shown in the 
accompanying drawings is to be interpreted as illustrative and 
not in a limiting sense. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION I 

(i) APPLICANT: Xiahore, Canaan M. 

Barry, Corard r. 

(11) TITLE OF INVENTION: Glyphooate Reeiatant Plant* 

(111) NUMBER OF SEQUENCES t 33 

(Iv) CORRESPONDENCE ADDRESS I 

(A) ADDRESSEE > Thomaa P. McBride, Monaanto Co. BB4F 

(B) STREETS 700 Chesterfield Villa?* Parkway 

(C) CITY i St. Lou I a 

(D) STATE: Mlaaourl 

(E) COUNTRY i USA 

(F) ZIPS 63198 

(v) COMPUTER READABLE FORMS 

(A) MEDIUM TYPES Floppy disk 

(B) COMPUTERS IBM PC compatible 

(C) OPERATING SYSTEMS PC— DOS /MS —DOS 

(D) SOFTWARE t Pat ant In Release #1*0, Version #1.25 

(vl) CURRENT APPLICATION DATAt 

(A) APPLICATION NUMBERS 

(B) PILING DATES 

(C) CLASSIFICATIONS 

(viii) ATTORNEY/ AGENT INFORMATIONS 

(A) NAME? McBride, Thomaa P. 

(B) REGISTRATION NUMBERS 32706 

(C) REFERENCE/DOCKET NUMBER: 38-21(10533) 

(lx) TELECOMMUNICATION INFORMATIONS 
(A) TELEPHONES (314)537-7357 



(2) INFORMATION FOR SEQ ID NOsl: 

(1) SEQUENCE CHARACTERISTICS} : 

(A) LENGTHS 564 base pairs 

(B) TYPES nuclaic acid 

(C) STRANDEDNESS s double 

(D) TOPOLOGY* linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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ATTTAGCAGC ATTCCAGATT GCGTTCAATC AACAAGGTAC GACCCATATC ACTTTATTCA 
AATTGGTATC GCCAAAACCA AGAAGGAACT CCCATCCTCA AAGCTTTGTA AGGAAGAATT 
CTCAGTCCAA AGCCTCAACA AGGTC AGGG T ACAGACTCTC CAAACCATTA CCCAAAACCT 
ACAGGAGATC AATCAAGAAT CTTCAATCAA AGTAAACTAC TGTTCCAGCA CATGCATCAT 
CCTCAGTAAG TTTCAGAAAA ACACATCCAC CGAACACTTA AAGTTAGTGG GCATCTTTCA 
AAGTAATCTT GTCAACATCG AGCAGCTGCC TTGTGGGGAC CAGACAAAAA AGCAATGGTG 
C AG AATTGTT ACGCGCACCT ACCAAAAGCA TCTTTGCCTT TATTGCAAAA GATAAAGCAG 
ATTCCTCTAC TACAAGTGCC GAACAAAATA ACCTGGAAAA GAGCTGTCCT GACAGCCCAC 
TCACTAATGC GTATGACGAA CGCAGTGACG ACCACAAAAG AATTTTCCCT CTATATAAGA 
AGGCATTTCA TTCCCATTTG AAGG 
(2) INFORMATION FOR SEQ ID NO*2t 

<i) SEQUENCE CHARACTERISTICS l 

(A) LENGTH* 27 *>»•• pair* 

(B) TYFSs nuclaic acid 

(C) STRANDEDNESS* doubla 

(D) TOPOLOGY: linear : 

(ii) MOLBCDLB TYPE* DNA (ganomic) 

(xi) SEQUENCE DESCRIPTION SEQ ID NO*2s 
ATCATCAGAT ACTAACCAAT ATTTCTC 
(2) INFORMATION FOR SEQ ID NO*3* 

(t) SEQUENCE CHARACTERISTICS I 

(A) LENGTH i 1689 ba»« pair* 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS * double 

(D) TOPOLOGY: linear 

(Ii) MOLECULE TYPE* DNA (genomic) 



60 
12C 
180 
240 
300 
360 
420 

4ao 

540 
564 



27 



G20832 



(xi) SEQUENCE DESCRIPTION* SEQ ID NO* 3s 

NCATGGACGT CTGATCGAAA TCGTCGTTAC CCCAGCAAGG TAAGCCACCC CGAATTTTAT 
CACCTACCGC GAAACGGTGG CTAGGCAGCG ACAGACTCTC GGCTCCCCGG GACCATCCTA 
wtc^aSa CCACAAAAAA GTACGCATCC CTGGAGCCGC AATCCTCGGC CTATCCACGG 



60 
120 
180 



& 
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C0CT0A7OCT TCAQCQCCOC OOATTCAAA9 TCACCTTOAT TOACCCOAAC CCTCCTOOCO 240 

AAOOTOCATC OTTTOOOAAT OCO00ATQCT TCAAOOOCTC ATCOOTCOTC CCTATOTCCA 30O 

TOCCOOCAAA CTTGACOAOC CTOCCCAAOT OOCTCCTTCA CCCOATOOOC COTTOTCAAT 360 

CCOOTTCAGC TATTTCCAAC CATCATCCCT OOTXOATTC© CTTTCTOTTA OCCOOAAOAC 42C 

CAAACAAOCT OAAOOAOCAO OCOAAAOCAC TCCOCAATCT CATCAAOTCC ACOOTOCCTC 480 

TOATCAAGTC ATTGOCGOAO CAOOCTOATO COAOCCATCT OATCOQCCAT OAAOOTCATC 54 0 

TOACCCTATA TCOTOOAOAA OCAOACTTCO CCAAOOACOO COOAOOTTOO OAACTOOOOC 60 0 

CTCTCAACOO TOTTCOCACO CAOATCCTCA 4COCOOATOC OTTOOOOGAT TTCOATCCOA 660 

ACTTOTCOCA TCCCTTTACC AAGOGCATTC TXAIACAAOA QAACOCTCAC ACGATTAATC 720 

CCCAACCCCT COTOACCCTC TTOTTTCOOC OTTTTATCQC OAACOOTOOC OAATTCOTAT 7tt0 

CTCCCCCTOT CATCOGCTTT CAOACTOAAO 0TA0OOCOCT TAAAOOCATT ACAACCACOA ©40 

AC0OC 0 TTCT OOCOOTTOAT OCAOOOOTTO TCOCAOCOOO OOCAGACTOO AAATCACTTO »OD 

CTAATTCOCT AGGCOATOAC AT CC C O CTC O ATACCOAACQ TOOATATCAT ATCOTCATCO »60 

COAATCCGOA AOCCOCTCCA COCATTCCCA COACCOATOC OTCAOOAAAA TTCATOOCOA 1020 

CACCTATOGA AATOOQOCTT COCOTOOO00 OTACOOTTOA OTTCOCTOOO CTCACAOCCO 1000 

CTCCTAACTO OAAACOTQCO CATOTOCTCT ATACQCAOOC TOOAAAACTT CTTCCAOCCC 1140 

TOOCOCCTOC OAOTTCTOAA OAACOATATT CCAAATOOAT OOOOTTCCOO CCOAOCATCC 1200 

CCOATTCCCT CCCCOTOATT OOCOOOOCAA CCC OOA C A CC OGAOOTAATC TATOCTTTCO 1260 

CCCATCOTCA TCTCOOCATO ACAO000CGC COATOACOOC AACOCTCOTC TCACAOCTCC 1320 

TCCCACCCOA AAAOACCTC* ATOOACATTT COCCCWOOC ACCAAACCOC TTTOCTATTO 1200 

GCAAATCGAA OCAAACOOOT COOOCAA07T AAOTACTTAC OCOOTCOTOA 0TACA0COCA 1440 

gagcoootot gaaoatgam ctocaccsco gaatcacctc ooaoacocoa aatoccocaa 1S00 

ATAQAACACA TATTAAOOAO TCACOCCCCO AAOCCTTTOO OTCACTACAO TCAOOCOOCC 1560 

CGAGCGCCTO GATTCATTCA TOTTTCC00T CACCTTCCGA TCAAACCAOA AGOCCA0TC0 1*20 

CAGCAATCTO AOOATCTCOT CGATAACCAO OCCAOTCTCO TTCTCCOOAA TTTOCTOOCC 1680 

CTACTCCAO 1689 
(2) XMTOXftATXOtf TOK 8YQ 10 *Di4* 

(1) SSQUSHCS CHAJUfcCT*AX»TXC«> 



92041559 



•WO 92/00377 



PCT/US91/04514 



95 



(A) LENGTHS 1293 base pairs 

(B) TYPBs nucleic acid 

(C) STRANDEDNESS s double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURES 

(A) NAME/KEY: CDS 

(B) LOCATIONS 1..1293 



(xi) SEQUENCE DESCRIPTION SEQ ID NOs4s 

ATO TCT GAG AAC CAC AAA AAA GTA GGC ATC CCT CCA CCC CCA ATC GTC 
Met Ser Glu Asn' Hie Lye Lya Val Gly He Ala Gly Ala Gly He Val 

1 5 10 15 



GGC GTA TCC ACQ GCG CTG ATC CTT CAG CCC CCC GGA TTC AAA GTC ACC 
Gly Val Cye Thr Ala Leu Met Leu Gin Arg Arg Gly Phe Lye Val Thr 

20 25 30 



TTC ATT GAC CCC AAC CCT CCT GGC GAA GOT CCA TCC TTT GCG AAT GCC 
Leu XI* Asp Pro Asn Pro Pro Gly Glu Gly Ala Ser Phe Gly Asn Ala 

35 40 45 



GGA TCC TTC AAC GGC TCA TCC GTC GTC CCT ATC TCC ATO CCC GGA AAC 
Gly Cys Phe Asn Gly Ser Ser Val Val Pro Met Ser Met Pro Gly Asn 

50 55 ' 60 



TTG ACC AGC CTG CCC AAC TCC CTC CTT GAC CCG ATC GGC CCT TCT CAA 
Leu Thr Ser Val Pro Lys Trp Leu Leu Asp Pro Met Gly Arg Cys Gin 

65 70 75 80 



TCC GGT TCA CCT ATT TCC AAC CAT CAT CCC TCC TTG ATT CCC TTT CTG 
Ser Gly Ser Ala He Ser Asn . His His Ala Trp Leu He Arg Phe Leu 

85 90 95 



TTA CCC GGA AGA CCA AAC AAGGTG AAG GAG CAG GCG AAA CCA CTC CCC 
Leu Ala Gly Arg Pro Asn Lys Val Lys Glu Gin Ala Lys Ala Leu Arg 
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100 



105 



110 



AAT CTC ATC AAG TCC ACQ CTG CCT CTG ATC AAG TCA TTO CCG GAG GAG 
Asn Leu Ha Lye Sor Thr Val Pro Leu II? Lya Sar Leu Ala Glu Glu 



115 



120 



125 



384 



CCT GAT GCG AGC CAT CTG ATC CGC CAT GAA GOT CAT CTG ACC GTA TAT 
Ala Asp Ala Ser His Leu lie Arg His Glu Oly His Lau Thr Val Tyr 



130 



135 



140 



432 



CCT GGA GAA CCA CAC TTC GCC AAG GAC CGC GGA GGT TGG GAA CTC CCG 
Arg Gly Glu Ala Asp Pha Ala Lys Asp Arg Gly Gly Trp Glu Lau Arg 



145 



150 



155 



160 



480 



CCT CTC AAC GGT GTT CGC ACG GAG ATC CTC AGC GCC GAT GCG TTG CCG 
Arg Lau Aan Gly Val Arg Thr Gin II* Lau Sar Ala Asp Ala Lau Arg 



165 



170 



175 



528 



GAT TTC GAT CCG AAC TTO TCO CAT GCG TTT ACC AAC CGC ATT CTT ATA 576 
Aap Pha Aap Pro Aan Lau Sar Hla Ala Pha Thr Lya Gly I la Leu lie 

180 185 190 



GAA GAG AAC GGT CAC ACG ATT AAT CCG CAA GGO CTC CTC ACC CTC TTG 624 
Glu Glu Aan Gly His Thr lie Asn Pro Gin Gly Leu Val Thr Leu Leu 

195 '200 205 



TTT CCG CCT TTT ATC GCG AAC GGT GGC GAA TTC GTA TCT GCG CCT GTC 672 
Pha Arg Arg Phe Zle Ala Asn Gly Gly Glu Pha Val Sar Ala Arg Val 

210 215 220 

ATC GGC TTT GAG ACT GAA GGT AGG GCG CTT AAA GGC ATT ACA ACC ACG 720 
lie Gly Phe Glu Thr Glu Gly Arg Ala Leu Lya Gly lie Thr Thr Thr 

225 230 235 240 

AAC GGC CTT CTG GCC CTT GAT GCA GCG GTT GTC CCA GCC GGC CCA CAC 768 
Asn Gly Val Leu Ala Val Asp Ala Ala Val Val Ala Ala Gly Ala Bis 
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250 



255 



TCG XXX TCA CTT OCT AAT TCC CTA CCC CAT CAC ATC CCG CTC CAT ACC 
Ser Lys Ser Leu Ala Asn Ser Leu Gly Asp Asp He Pro Leu Asp Thr 



260 



265 



270 



616 



CAA CGT GGA TAT CAT ATC CTC ATC CCC AAT CCC CAA CCC CCT CCA CCC 
Glu Arg Gly Tyr His He Val He Ala Asn Pro Glu Ala Ala Pro Arg 



275 



2B0 



265 



664 



ATT CCQ ACO ACC GAT CCC TCA CCA AAA TTC ATC OCO ACA CCT ATC CAA 
He Pro Thr Thr Asp Ala Ser Cly Lya Phe I la Ala Thr Pro Mat Glu 



290 



295 



300 



912 



ATC CCC CTT CCC GTC GCC GGT ACC CTT GAG TTC CCT CCC CTC ACA CCC 
Mat Cly Leu Arg Val Ala Cly Thr Val Glu Pha Ala Gly Lam Thr Ala 



305 



310 



315 



320 



960 



CCT CCT AAC TCG AAA CGT GCC CAT CTC CTC TAT ACO CAC CCT CCA AAA 
Ala Pro Asn Trp Lys Arg Ala His Val Lau Tyr Thr Bis Ala Arg Lya 



325 



330 



335 



1008 



CTT CTT CCA GCC CTC GCC CCT GCC ACT TCT GAA CAA CCA TAT TCC AAA 
Lau Lau Pro Ala Lau Ala Pro Ala Sar Sar Glu Glu Arg Tyr Ser Lye 



340 



345 



350 



1056 



TCC ATC GCC TTC CCC CCC ACC ATC CCG GAT TCC CTC CCC CTC ATT CCC 
Trp Mat Cly Pha Arg Pro Ser Lie Pro Asp Sar Lau Pro Val I la Cly 



355 



360 



365 



1104 



CCC CCA ACC CCG ACA CCC GAC CTA ATC TAT CCT TTC CCC CAT CCT CAT 
Arg Ala Thr Arg Thr Pro Asp Val lie Tyr Ala Ph« Cly Bis Cly Bis 



370 



375 



380 



1152 



CTC GCC ATG ACA CGG GCC CCG ATG ACC CCA ACG CTC GTC TCA GAG CTC 
Lau Gly Met Thr Cly Ala Pro Met Thr Ala Thr Leu Val Ser Glu Leu 



1200 
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390 



98 - 
395 



400 



CTC CCA GCC GAA AAO ACC TCA ATC CAC ATT TCO CCC TTC CCA CCA AAC 
Leu Ala Gly Clu Lya Thr Sor II* Aap Ila Sar Pro Pha Ala Pro A»n 



405 



410 



415 



1248 



CGC TTT GGT ATT GGC AAA TCC AAG CAA ACQ GGT CCO CCA ACT TAA 
Arg Pha Gly I la Gly Lya Sar Lya Gin Thr Gly Pro Ala Sar 
420 425 430 



12*3 



(2> INFORMATION FOR SEQ ID NOs 5s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 430 amino acida 

(B) TYPES amino acid 
<D) TOPOLOGY s linaar 

(ii) MOLECULE TYPES protain 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOt 5 J 

Met Sar Glu Aan His Lya Lya Va,l Gly Ila Ala Gly Ala Gly Ila Val 

1 5 10 15 

Gly Val cya Thr Ala Lau Hat Lau Gin Arg Arg Gly Pha Lya Val Thr 

20 . 25 30 

Lau Ila Aap pro Aan Pro Pro Gly Glu Gly Ala Sar Pha Gly Aan Ala 
35 40 45 

Gly Cya Pha Aan Gly Sar Sar Val Val Pro Mat Sar Mat Pro Gly Aan 

50 55 60 



Lau Thr Sar Val Pro Lya Trp Lau Leu Aap Pro Hat Gly Arg Cya Gin 
65 70 75 80 



Sar Gly Sar Ala Ila Sar Aan Hia Hia Ala Trp Lau Ila Arg Pha Lau 

85 90 95 
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Leu Ala Gly Arg Pro Aan Ly» Val Lye Glu Gin Ala Lys Ala Leu Arg 

100 105 no 

Asn Leu I la Lya sar Thr Val Pro Lau I la Lya Sar Leu Ala Glu Glu 

115 120 125 

Ala Aap Ala Ser His Lau Xla Arg Hia Glu Gly Hia Lau Thr Val Tyr 
130 135 140 

Arg Gly Glu Ala Aap Pha Ala Lya Aap Arg Gly Gly Trp Glu Lau Arg 

145 150 155 160 

Arg Lau Aan Gly Val Arg Thr Gin lie Leu Sar Ala Aap Ala Lau Arg 

165 170 175 

Asp Pha Aap Pro Aan Lau Sar Hia Ala Pha Thr Lya Gly I la Lau Zla 

180 165 190 

Glu Glu Aan Gly Bia Thr I la Aan Pro Gin Gly Lau Val Thr Lau Leu 

195 200 205 

Phe Arg Arg Pha Zla Ala Aan Gly Gly Glu Phe Val Sar Ala Arg Val 

210 215 220 

He Gly Pha Glu Thr Glu Gly Arg Ala Leu Lye Gly lie Thr Thr Thr 
225 230 235 240 

Aan Gly Val Lau Ala Val Aap Ala Ala Val Val Ala Ala Gly Ala Hia 
245 250 255 

Ser Lys Ser Leu Ala Asn Ser Leu Gly Asp Asp He Pro Leu Asp Thr 
260 265 270 
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Glu Arg Gly Tyr His lie Val lie Ala X»n Pro Glu Ala Ala Pro Arg 
275 280 2B5 

lie Pro Thr Thr Asp Ala Ser Gly Lye Phe lie Ala Thr Pro Met Glu 
290 295 300 

Met Gly Leu Arg Val Ala Gly Thr Val Glu Phe Ala Gly Leu Thr Ala 
305 310 315 320 

Ala Pro Asn Trp Lys Arg Ala Kia Val Leu Tyr Thr His Ala Arg Lye 

325 330 335 

Leu Leu Pro Ala Leu Ala Pro Ala Ser Ser Glu Glu Arg Tyr Ser Lye 
340 345 350 

Trp Met Gly Phe Arg Pro Ser lie Pro Asp Ser Leu Pro Val lie Gly 

355 360 365 

Arg Ala Thr Arg Thr Pro Aep Val lie Tyr Ala Phe Gly Hie Gly Hie 

i 

370 375 380 

Leu Gly Mot Thr Gly Ala Pro Met Thr Ala Thr Leu Val Ser Glu Leu 

385 390 395 400 

Leu Ala Gly Glu Lys Thr Ser lie Asp lie Ser Pro Phe Ala Pro Asn 
405 410 415 



Arg Phe Gly lie Gly Lye Ser Lys Gin Thr Gly Pro Ala Ser 
420 425 430 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH i 1296 base pairs 

( B ) TYPE t nucleic acid 

(C) STRANDEDNESS i double 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE : DNA (recombinant) 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:6: O'ZOS^ 
ATCCCTGAGA ACCACAAAAA ACTAGGCATC GCTGGACCCG GAATCGTCCG CGTATGCACG 60 

CCGCTGATGC TTCAGCGCCG CGGATTCAAA GTCACCTTGA TTGACCCGAA CCCTCCTGCC 120 

GAAGGTGCAT CGTTTGGGAA TGCCCCATGC TTCAACGGCT CATCCGTCGT CCCTATGTCC 180 

ATGCCGGGAA ACTTCACGAG CGTCCCGAAO TCGCTCCTTG ACCCGATGGC GCCGTTCTCA 240 

ATCCGGTTCA GCT A TTTTCC AACCATCATG CCCTGGTTGA TTCGCTTTCT GTTACCCCGA 300 

AGACCAAACA AGGTGAAGGA GCAGGCGAAA GCACTCCGCA ATCTCATCAA GTCCACGGTG 3 60 

CCTCTGATCA AGTCATTGCC GCAGGAGGCT GATCCGAGCC ATCTGATCCO CCATGAAGGT 420 

CATCTGACCG TATATCGTGG AG AAG CAGAC TTCGCCAAGG ACCGCGGAGG TTGGGAACTG 4 B0 

CGGCGTCTCA ACGGTGTTCG CACGCAGATC CTCAGCGCCG ATGCGTTGCG GCATTTCGAT 540 

CCGAACTTGT CGCATGCGTT TACCAAGGGC ATTCTTATAG AAGAGAACGG TCACACGATT 600 

AATCCGCAAC GGCTCCTCAC CCTCTTGTTT CGCCCTTTTA TCGCGAACGG TGGCGAATTT 660 

GTATCTCCCC GTCTCATCGG CTTTGACACT GAAGGTAGGG CGCTTAAAGO CATTACAACC 720 

ACGAACGCCG T T CTGCCCGT TGATGCAGCG GTTGTCGCAG CCCGCGCACA CTCGAAATCA 780 

CTTGCTAATT CGCTAGGCGA TGACATCCCG CTCGATACCG AACGTGGATA TCATATCGTC 840 

ATCCCGAATC CGGAAGCCGC TCCACGCATT CCGACGACCG ATGCGTCAGG AAAATTCATC 900 

CCGACACCTA TGGAAATCGG GCTTCGCGTG GCGGGTACGG TTGAGTTCCC TGGGCTCACA 960 

GCCGCTCCTA ACTGGAAACG TGCGCATGTG CTCTATACGC ACGCTCGAAA ACTTCTTCCA 1020 

GCCCTCCCGC CTGCGAGTTC TGAAGAACGA TATTCCAAAT GGATCGCGTT CCGGCCGACC 1080 

ATCCCGGATT CGCTCCCCCT GATTGGCCGG GCAACCCGCA CACCCGACGT AATCTATGCT 1140 

TTCCGCCACG GTCATCTCGG CATGACACCG GCGCCGATGA CCGCAACGCT CGTCTCAGAG 1200 

CTCCTCGCAG GCGAAAACAC CTCAATCGAC ATTTCGCCCT TCGCACCAAA CCGCTTTGGT 1260 

ATTGGCAAAT CCAACCAAAC GGCTCCGGCA AGTTAA 1296 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1296 base pairs 

( B ) TYPE i nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (recombinant) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: Q'?os _! %4 

ATGGCTGAGA ACCACAAAAA AGTAGGCATC GCTCGACCTG GAATCGTTGG TCTATGCACT 60 

CCTTTGATCC TTCAACGTCG TCGATTCAAA GTCACCTTGA TTGACCCCAA CCCTCCTCGC 120 

GAAGGTCCAT CGTTTGCCAA TCCCGCATGC TTCAACCGCT CATCCGTCCT CCCTATGTCC 180 

ATGCCGGGAA ACTTGACGAG CGTCCCGAAG TGGCTCCTTG ACCCGATGGG GCCGTTGTCA 240 

ATCCGGTTCA GCTATTTTCC AACCATCATG CCCTGGTTGA TTCGCTTTCT CTTAGCCGGA 300 

AGACCAAACA AGCTGAAGGA CCAGCCGAAA CCACTCCGCA ATCTCATCAA GTCCACGGTG 360 

CCTCTGATCA AGTCATTGGC GCAGGAGGCT GATGCGAGCC ATCTGATCCG CCATG AAGG T 420 

CAT CTG ACCG TATATCGTCG AGAACCAGAC TTCGCCAAGG ACCGCGGAGG TTGCGAACTG 480 

CGCCGTCTCA ACGGTGTTCG CACGCAGATC CTCTCTGCTG ATGCTTTGCG TGATTTCGAT 540 

CCTAACTTGT CGCATCCTTT TACCAAGGCC ATTCTTATAC AAGAGAACCG TCACACGATT 600 

AATCCG CAAG GCCTCGTGAC CCTCTTCTTT CGGCGTTTTA TCGCGAACGG TGGCGAATTT 660 

GTATCTGCGC GTGTCATCGG TTTTGAGACT GAAGGTCGTG CTCTCAAAGG CATTACAACC 720 

ACTAACCCTG TTCTGGCTGT TGATGCAGCT GTTGTTGCAG CTGGTGCACA CTCTAAATCA 780 

CTTGCTAATT CCCTACCCGA TGACATCCCG CTCGATACCG AACGTGCATA TCATATCGTC '840 

ATCGCGAATC CGGAAGCCGC TCCACGCATT CCGACGACCG ATGCGTCAGG AAAATTCATC 900 

GCGACACCTA TGGAAATGGG TCTTCGTGTT GCTCGTACTG TTCAGTTTGC TGGTCTCACA 960 

GCTGCTCCTA ACTGGAAACG TGCGCATGTG CTCTATACGC ACCCTCCAAA ACTTCTTCCA 1020 

CCCCTCCCCC CTGCGAGTTC TCAAGAACGA TATTCCAAAT GGATGGGTTT TCGTCCTAGC 1080 

ATTCCTGATT CTCTTCCAGT GATTGGTCGT GCAACTCGTA CACCCCACGT AATCTATCCT 1140 

TTTGGTCACG GTCATCTCGG TATGACAGGT GCTCCAATGA CTGCAACTCT CGTCTCAGAG 1200 
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CTCCTCGCAG GCGAAAAGAC CTCAATCGAC ATTTCCCCCT TCGCACCAAA CCGCTTTCGT 3.260 

ATTCGCAAAT CCAAGCAAAC GCCTCCGCCA AGTTAA 1296 
(2) INFORMATION FOR SEQ ID NO* 8 s 

<i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH I 1296 baae pairs 

(B) TYPEt nucleic acid 

(C) STRAND EDNESS t double 

(D) TOPOLOGYt linear 

(ii) MOLECULE TYPEt DNA (synthetic) 

(xi> SEQUENCE DESCRIPTION: SEQ ID NOtS: Q20B35 

ATGC CTG AG A ACCACAAGAA GGTTGGTATC GCTGGAGCTG GAATCGTTGG T C TTTGCACT 60 

CCTTTGATGC TTCAACGTCG TGGATTCAAG GTTACCTTGA TTGATCCAAA CCCACCACGT 120 

GAAGGTGCTT CTTTCCGTAA CGCTCGTTGC TTCAACCCTT CCTCCGTTCT TCCAATGTCC 180 

ATC CCAGGAA ACTTGACTAG CGTTCCAAAG TGGCTTCTTG ACCCAATGGG TCCATTGTCC 240 

ATCCGTTTCA GCTACTTTCC AACCATCATG CCTTGGTTGA TTCGTTTCTT CCTTGCTGCA 300 

AGACCAAACA AGGTGAAGGA GCAAGCTAAG GCACTCCCTA ACCTCATCAA GTCCACTGTG 360 

CCTTTCATCA AGTCCTTGGC TGAGGAGGCT GATGCTAGCC ACCTTATCCG TCACGAAGGT 420 

CACCTTACCG TGTACCGTGG AGAAGCAGAC TTCGCCAAGG ACCGTGGAGG TTCCGAACTT 480 

CCTCGTCTCA ACGGTGTTCG TACTCAAATC CTCAGCGCTG ATGCATTGCG TGATTTCGAT 540 

CCTAACTTGT CTCACGCCTT TACCAAGGGA ATCCTTATCG AAGAGAACGG TCACACCATC 600 

AACCCACAAG GTCTCGTGAC TCTCTTGTTT CGTCGTTTCA TCCCTAACGG TCGAGAGTTC 660 

GTCTCTGCTC GTGTTATCGG ATTCGACACT GAAGGTCCTG CTCTCAAGGG TATCACCACC 720 

ACCAACGGTG TTCTTGCTGT TGATGCAGCT GTTGTTGCAG CTGGTGCACA CTCCAAGTCT 780 

CTTCCTAACT CCCTTGGTGA TCACATCCCA TTCGATACCC AACCTGCATA CCACATCGTG 840 

ATCGCCAACC CAGAAGCTGC TCCACGTATT CCAACTACCG ATGCTTCTGG AAAGTTCATC 900 

GCTACTCCTA TCGAGATGGG TCTTCGTGTT GCTCGAACCG TTGAGTTCGC TCCTCTCACT 960 

GCTGCTCCTA ACTGCAAGCG TCCTCACGTT CTCTACACTC ACCCTCCTAA GTTGCTTCCA 1020 

CCTCTCGCTC CTGCCAGTTC TCAAGAACGT TACTCCAAGT GGATGGGTTT CCGTCCAAGC 1080 
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ATCCCACATT CCCTTCCAOT GATTGCTCCT CCTACCCGTA CTCCACACGT TATCTACCCT 1140 

TTCCGTCACG GTCACCTCGG TATGACTGGT CCTCCAATGA CCCCAACCCT CGTTTCTGAO 1200 

CTCCTCGCAG CTGAGAACAC CTCTATCGAC ATCTCTCCAT TCGCACCAAA CCGTTTCCGT 1260 

ATTGGTAAGT CCAAGCAAAC TGCTCCTGCA TCCTAA 1296 
(2) INFORMATION FOR SEQ ID NOt9t 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 279 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s double 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA (recombinant) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOl9t Q20836 

AGATCTCCAC AATGGCTTCC TCTATGCTCT CTTCCGCTAC TATGGTTGCC TCTCCGGCTC 60 

AGGCCACTAI GGTCGCTCCT TTCAACGCAC TTAAGTCCTC CGCTGCCTTC CCAGCCACCC 120 

GCAAGGCTAA CAACGACATT ACTTCCATCA CAAGCAACGG CGGAAGAGTT AACTGCATGC 180 

AGGTGTGGCC TCCGATTGGA AAGAAGAAGT TTGAGACTCT CTCTTACCTT CCTGACCTTA 240 

CCGATTCOGO TCCTCGCCTC AACTGCATGC AGGCCATGG 279 

(2) INFORMATION FOR SEQ ID NOtlO: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 318 base pairs 

(B) TYFEt nucleic acid 

(C) STRAND EDNESS s .double 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPSs DNA (recombinant) 

Q20S37 . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOtlO* 
AGATCTATCG ATAACCTTGA TGTAATTGGA GGAAGATCAA AATTTTCAAT CCCCATTCTT 60 
CGATTCCTTC AATTGAAGTT TCTCCGATGG CGCAAGTTAG CAGAATCTGC AATCGTGTGC 120 
AGAACCCATC TCTTATCTCC AATCTCTCGA AATCCAGTCA ACGCAAATCT CCCTTATCCG 180 
TTTCTCTGAA GACGCAGCA CATCCACGAG CTTATCCGAT TTCGTCGTCG TCGGGATTGA 240 
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AGAACAGTGG GATCACGTTA ATTCGCTCTG AGCTTCGTCC TCTTAAGCTC ATCTCTTCTG 



TTTCCACGGC GTCCATCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH x 119 base pairs 

(B) TYPBs nuclaic acid 

(C) STRANDED NESS : doubla 

(D) TOPOLOGYs linaar 

(ii) MOLECULE TYPE! DNA (ganoaic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 Ux " - 
NCATGGACGT CTGATCGAAA TCGTCCTTAC CGCAGCAACG TAAGCCACGC CGAATTTTAT 
CACCTACCGC GAAACGGTGG CTAGGCAGCG AGAGACTGTC GCCTCCGCGG GAGCATCCT 
(2) INFORMATION FOR SEQ ID NO 1 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDSDNSSS s dbubl* 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: DNA (ganotnic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12x Q20B3<? 
GTACTTACGC GGTCCTGAGT ACAGCGCAGA CCCCGTCTCA AGATCAATCT CCACCTCGCA 
ATCACCTCGO AGACGCGAAA TGGCGCAAAT AGAACACATA TTAACG AGTC ACGCCCCGAA 
GCCTTTGGGT CACTACAGTC AGGCGCCCCG AGCGGGTGCA TTCATTCATG TTTCCGCTCA 
GCTTCCGATC AAACCAGAAG CCCAGTCGCA GCAATCTGAC. CATCTCGTCG .ATAACCAGGC 
CACTCTCCTT CTCCGGAATT TGCTGGCCGT ACTCGAO 
(2) INFORMATION TOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 baaa> pairs 

(B) TYPE: nuclaic acid 

(C) STRANDSDNESS: doubla 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE I DNA (tynthetic) 
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(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOsl3s 
CAGACACTGT CGACTCCCCO CCAGCATCAT ATG 
(2) INFORMATION FOR SEQ ID NO I 14 t 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTHS 35 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO » 14: 
GAACGAATCC AACCTTCTCA CGACCGCGTA AGTAC 
(2) INFORMATION FOR SEQ ID NO s 15 1 

<i) SEQUENCE CHARACTERISTICS s 

(A) LfcNGTHs 24 base pair* 

(B) TYPES nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE: DNA (synthstic) 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 15s 
CCCGAGATGA CCGTGGCCCA AACC 
(2) INFORMATION FOR SEQ ID NO J 16 1 

(1) SEQUENCE CHARACTERISTICS s . 

(A) LENGTHS 24 base pairs 

(B) TYFSs nucleic acid 

(C) STRANDEDNESSs sing la 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPEs DNA (synthetic) 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 16s 
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GCGAATGCCG GATGCTTCAA CCGC . 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : .double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (recombinant) 
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24 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: l..i296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG GCT GAG AAC CAC AAG AAG GTT GGT ATC GCT GGA GCT GGA ATC GTT 
Met Ala Glu Aen Hie Lye Lye Val Gly He Ala Gly Ala Cly He Val 



10 



15 



4B 



GCT GTT TGC ACT GCT TTG ATG CTT CAA CGT CGT GGA TTC AAG GTT ACC 
Gly Val Cye Thr Ala Leu Met Leu Gin Arg Arg Gly Phe Lye Val Thr 



20 



25 



30 



96 



TTG ATT GAT CCA AAC CCA CCA GGT CAA GCT CCC TCT TTC GGT AAC OCT 
Leu He Asp Pro A»n Pro Pro Gly Glu Gly Ala Ser Phe Gly A»n Ala 



35 



40 



45 



144 



CGT TGC TTC AAC GGT TCC ICC CTT GTT CCA ATG TCC ATG CCA GGA AAC 
Gly Cye Phe Asn Gly Ser Ser Val Val Pro Met Ser Met Pro Gly Aen 



50 



55 



60 



192 



TTG ACT ACC GTT CCA AAG TOG CTT CTT GAC CCA ATG CGT CCA TTG TCC 
Leu Thr Ser Val Pro Lye Trp Leu Leu Asp Pro Met Gly Pro Leu Ser 



65 



70 



75 



80 



240 



ATC CGT TTC GGC TAC TTT CCA ACC ATC ATG CCT TGG TTG ATT CGT TTC 
He Arg Phe Gly Tyr Phe Prp Thr He Met Pro Trp Leu He Arg Phe 



85 



90 



95 



288 
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TTG CTT GCT GCA AGA CCA AAC AAG GTG AAG GAG CAA GCT AAO CCA CTC 
Leu Lou Ala Gly Arg Pro Aon Lyo Val Ly» Glu Gin Ala Lyo Ala Lou 



100 



105 



110 



336 



COT AAC CTC ATC AAG TCC ACT GTG CCT TTG ATC AAG TCC TTG GCT GAG 
Arg Aon Lou lie Ly» Sor Thr Val Pro Lou Ilo Lyo Sor Lou Ala Glu 



115 



120 



125 



384 



GAG GCT GAT GCT ACC CAC CTT ATC COT CAC GAA GGT CAC CTT ACC GTG 
Glu Ala Asp Ala Sor Hi» Lou Ilo Arg Hio Glu Gly. Hie Lou Thr Val 



130 



135 



140 



432 



TAC CGT GGA GAA GCA GAC TTC GCC AGO CAC CGT GCA GGT TCC CAA CTT 
Tyr Arg Gly Glu Ala Aop Pho AJ.a Arg Aop Arg Gly Oly Trp Glu Lou 



145 



150 



155 



160 



400 



CGT CGT CTC AAC GGT CTT CGT ACT CAA ATC CTC AGC CCT GAT GCA TTG 
Arg Arg Lou Aon Gly Val Arg Thr Gin Ilo Lou Sor Ala Asp Ala Lou 



165 



170 



175 



528 



CGT GAT TTC CAT CCT AAC TTG TCT CAC GCC TTT ACC AAG GGA ATC CTT 
Arg Aop Pho A»p Pro Aon Leu Sor Hio Ala Pho Thr Lyo Gly lie Lou 

ISO IBS 190 



576 



ATC CAA GAG AAC CCT CAC ACC ATC AAC CCA CAA GGT CTC CTC ACT CTC 
Ilo Glu Glu Aon Gly Hio Thr Ilo Aon Pro Gin Gly Leu Val Thr Lou 



195 



200 



205 



624 



TTG TTT CGT CGT TTC ATC GCT AAC GGT GGA GAG TTC GTG TCT GCT CGT 
Lou Pho Arg Arg Pho Ilo Ala Aon Gly Gly Glu Pho Val Sor Ala Arg 



210 



215 



220 



672 



GTT ATC GGA TTC GAG ACT GAA GGT CGT CCT CTC AAG GGT ATC ACC ACC 
val Ilo Gly Pho Glu Thr Glu Gly Arg Ala Leu Lyo Gly Ilo Thr Thr 



225 



230 



235 



240 



720 
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ACC AAC CGT GTT CTT CCT CTT CAT CCA GCT CTT CTT CCA OCT OCT CCA 
Thr Aan Gly Val Lau Ala Val Asp Ala Ala Val Val Ala Ala Gly Ala 



245 



250 



255 



768 



CAC TCC AAG TCT CTT CCT AAC TCC CTT GGT CAT GAC ATC CCA TTG CAT 
Hla Sor Lya Sar Lau Ala Aan Sar Lau Gly Aap Aap Zla Pro Lau Aap 



260 



265 



270 



816 



ACC GAA CGT CCA TAC CAC ATC GTG ATC GCC AAC CCA CAA CCT CCT CCA 
Thr Clu Arg Gly Tyr Hia I la Val I la Ala Aan Pro Glu Ala Ala Pro 



275 



260 



285 



864 



CCT ATT CCA ACT ACC CAT GCT TCT CGA AAC TTC ATC CCT ACT CCT ATG 
Arg Ila Pro Thr Thr Aap Ala Sar Oly Lya Pha Ila Ala Thr Pro Mat 



290 



295 



300 



912 



GAG ATG GGT CTT CGT GTT CCT GCA ACC GTT GAG TTC CCT GGT CTC ACT 
Glu Mat Gly Lau Arg Val Ala Gly Thr Val Glu Pha Ala Gly Lau Thr 



305 



310 



315 



320 



960 



GCT CCT CCT AAC TGC AAG CGT GCT CAC GTT CTC TAC ACT CGC GCT CGT 
Ala Ala Pro Aan Trp Lya Arg Ala Hia Val Lau Tyr Thr Arg Ala Arg 



325 



330 



335 



1008 



AAG TTG CTT CCA GCT CTC GCT CCT GCC ACT TCT CAA GAA CGT TAC TCC 
Lya Lau Lau Pro Ala Lau Ala Pro Ala Sar Sar Glu Glu Arg Tyr Sar 



340 



345 



350 



105.6 



AAG TGO ATG GGT TTC OCT CCA AGC ATC CCO GAT TCC CTT CCA GTG ATT 
Lya Trp Mat Oly Pha> Arg Pro Ser Ila Pro Aap Sar Lau Pro Val Ila> 



355 



360 



365 



1104 



GGT CGT GCT ACC COT ACT CCA CAC GTT ATC TAC GCT TTC CCT CAC GCT 
Gly Arg Ala Thr Arg Thr Pro Aap val Ila Tyr Ala Pha Gly Hla Gly 



370 



375 



380 



1152 
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CAC CTC OCT ATG ACT CCT OCT CCA ATG ACC CCA ACC CTC OTP TCT GAG 
His Lou Gly Met Thr Gly Ala Pro Met Thr Ala Thr Leu Val Ser Olu 



38S 



390 



395 



400 



1200 



CTC CTC GCA CGT GAG AAO ACC TCT ATC GAC ATC TCT CCA TTC CCA CCA 
Leu Leu Ala Gly Clu Lye Thr Ser !!• Aep lie Ser Pro Phe Ala Pro 



405 



410 



415 



1248 



AAC CGT TTC OCT ATT OCT AAG TCC AAG CAA ACT CGT CCT GCA TCC TAA 
Aan Arg Phe Gly lie Gly Lye Ser Lye Gin Thr Gly Pro Ala Ser 
420 425 430 



1296 



(2) INFORMATION FOR SEQ ID NOsl8i 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTHS 431 amino acide 

(B) TYPES amino acid 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPEt protein 

<xi) SEQUENCE DESCRIPTION s SEQ ID NO S 18 s 

Met Ala Olu Aen Hie Lye Lye Val Gly lie* Ala Gly Ala Gly He Val 

x 5 10 15 

Gly Val eye Thr Ala Leu Met Leu Oln Arg Arg Gly Phe Lye Val Thr 

20 25 30 

Leu 11m Aep Pro Aen Pro Pro Gly Olu Gly Ala Ser Phe Gly Aen Ala 

35 40 45 

Gly Cye Phe Aen Gly Ser Ser Val Val Pro Met Ser Mat Pro Gly Aen 

50 S5 60 



Leu Thr Ser Val Pro Lye Trp Leu Leu Aep Pro Met Gly Pro Leu Ser 
65 70 75 "0 
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Ilo Arg Pho Gly Tyr Pho Pro Thr lie Mot Pro Trp Leu lie Arg Phe 
B5 90 95 

Leu Leu Ala Oly Arg Pro A»n Ly» VaI Ly» Glu Gin Ala Ly» Ala Leu 

100 105 110 

Arg Aen Leu lie Ly» Ser Thr Val Pro Leu lie Lye Ser Leu Ala Glu 

115 120 125 

Glu Ala Aep Ala Ser Hie Leu lie Arg Hie Glu Gly Hie Leu Thr Val 
130 135 140 

Tyr Arg Gly Glu Ala Aep Phe Ala Arg Asp Arg Gly Gly Trp Glu Leu 
145 150 155 160 

Arg Arg Leu Aen Gly Val Arg Thr Gin lie Leu Ser Ala Aep Ala Leu 

165 170 175 

Arg A«p Phe Aep Pro Aen Leu Ser Hie Ala Phe Thr Lye Gly He Leu 

180 105 190 

lie Glu Glu Aen Gly Hie Thr Xle Aen Pro Gin Gly Leu Val Thr Leu 
195 200 205 

Leu Phe Arg Arg Phe Xle Ala Aen Gly Gly Glu Phe Val Ser Ala Arg 

210 215 220 

Val lie Gly Phe Glu Thr Glu Gly Arg Ala Leu Lye Gly Xle Thr Thr 
225 230 235 240 

Thr Aen Gly Val Leu Ala Val Aep Ala Ala Val Val Ala Ala Gly Ala 
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245 



250 



255 



H18 Ser Ly b Ser Leu Ala Aon Ser Leu Gly Aap Ab P He Pro Leu A«p 
260 * 265 270 

Thr Glu Arg Gly Tyr Hie lie Va; lie Ala Aen Pro Clu Ala Ala Pro 
275 280 285 

Arg He Pro Thr Thr Aep Ala Ser Gly Lye Phe He Ala Thr Pro Met 
290 2 9S 300 

Glu Met Gly Leu Arg Val Ala Gly Thr Val Glu Phe Ala Gly Leu Thr 
305 310 315 320 

Ala Ala Pro Aen Trp Lyo Arg Ala Hie Val Leu Tyr Thr Arg Ala Arg 
325 330 335 

Lys Leu Leu Pro Ala Leu Ala Pro Ala Ser Ser Glu Glu Arg Tyr Ser 
340 345 350 

Lye Trp Met Gly Phe Arg Pro Ser He Pro Aep Ser Leu Pro Val He 



355 



360 



365 



Gly Arg Ala Thr Arg Thr Pro A.p Val He Tyr Ala Phe Gly Hi. Gly 

• 370 375 , 380 

His Leu Gly Met Thr Oly Ala Pro Met Thr Ala Thr Leu Val Ser Glu 
385 390 395 400 

Leu Leu Ala Gly Clu Lyo Thr Ser He Aep He Ser Pro Phe Ala Pro 



405 



410 



415 



Asn Arg Ph* Gly He Gly Ly« Ser Ly« Gin Thr Gly Pro Ala Ser 
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<2) INFORMATION FOR SEQ ID NOt 19 1 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH i 29 base pair* 

(B) TYFEt nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE* DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION t SEQ ID NOsl9* 

CCTTCTCTAC ACTCCTGCTC GTAACTTCC 
29 

(2) INFORMATION FOR SEQ ID NO* 20* 

(i) SEQUENCE CHARACTERISTICS s 
(A) LENGTH* 29 base pairs 
<B) TYPEs nucleic acid 

(C) STRANDEDNESS * single 

(D) TOPOLOGY* linaar 

(ii) MOLECULE TYPE* DNA (synthetic) 



(Xi) SEQUENCE DESCRIPTION* SEQ ID NO*20t 
CCTTCTCTAC ACTAAGGCTC CTAAGTTGC 
(2) INFORMATION FOR SEQ ID NOi21* 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 29 base pairs 

(B) TYPSt nucleic acid 

(C) STRANDEDNESS* single 

(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE* DNA (synthetic) 



29 



(Xi) SEQUENCE DESCRIPTION* SEQ ID NO*21* 
CCTTCTCTAC ACTCAAGCTC CTAAGTTGC 
(2) INFORMATION FOR SEQ ID NO* 22: 



29 
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(1) SEQUENCE CHARACTERISTICS* 

(A) LENGTH J 29 baae pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS t single 
(0) TOPOLOGY s linear 

(ii) MOLECULE TYPEt DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION s SEQ ID NOs22: 
CCTTCTCTAC ACTCCTCCTC CTAACTTCC 
(2) INFORMATION FOR SEQ ID NOt23i 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 32 base pairs 

(B) TYPEs nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES DNA (synthetic) 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs23s 
CTCTACACTT GGGCTCGTAA CCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NO* 24: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 32 base pair* 

(B) TYPBs nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY s linear 

<ii) MOLECULE TYPES DNA (eynthetic) 



(xi) SEQUENCE DESCRIPTIONS SEQ. ID NOs24s 
CT CT ACACT A TCGCTCCTAA CCTTCTTCCA CC 
(2) INFORMATION FOR SEQ ID NOs25s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 32 base paire 

(B) TYPEt nucleic acid 

(C) STRANDEDNSSSs single 

(D) TOPOLOGY s linear 
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(11) MOLECULE TYPE i DNA (synthetic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25i 
CTCTACACTC TGGCTCGTAA GCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NOj26i 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH I 32 base pairs 

(B) TYPEs nucleic acid 

(C) STRAND EDNESS t single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE J DNA (synthetic) 

(Xi) SEQUENCE DESCRIPTION s SEQ ID NOi26: 
CTCTACACTG AAGCTCGTAA GCTTCTTCCA GC 
(2) INFORMATION FOR SEQ ID NOs27s 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTHS 62 baee pairs 

(B) TYPEs nucleic acid 

(C) STRAKDEDNESSs • ingle 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE i DNA (synthetic) 



32 



32 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO i 27s 
CCCTGGAGCT GGAATCGTTG GTCTATGCAC TCCTTTGATG CTTCAACGTC GTGGATTCAA 
AG 

(2) INFORMATION FOR SEQ ID HO 1 28: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 65 ba»e pairs 

(B) TYPEs nucleic acid 

(C) STRAND EDNESS s single 

(D) TOPOLOGY s linear 



60 
62 



(ii) MOLECULE TYPEs DHA (synthetic) 



Ii 
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<xl) SEQUENCE DESCRIPTION: SEQ ID NOi28l 
CCAGATCCTC TCTCCTCATG CTTTCCGTGA TTTCCATCCT AACTTGTCTC ATCCTTTTAC 60 

65 

CAAGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 41 bass pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GTCATCGGTT TTGAGACTGA AGGTCGTCCT CTCAAAGGCA T 

(2) INFORMATION FOR SEQ ID NO: 30: 

(1) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(it) MOLECULE TYPEt DNA {synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

TACAACCACT AACGGTGTTC TGCCTCTTGA TGCAGCTGTT GTTCCAGCTG GTGCACACTC 
TAAATCACT 

(2) INFORMATION FOR SEQ ID NO: 31* 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY a linear 

(li) MOLECULE TYPE i DNA (synthetic) 



60 
69 



m 

•m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGAAATGGGT CTTCOTGTTG CTGGTACTGT TGAGTTTGCT GGTCTCACAG CTGCTCCTAA 



60 
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(2) INFORMATION FOR SEQ ID NO 1 32 1 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH t 68 baa« paira 

(B) TYPE i nucleic acid 

(C) STRANDEDNESSt aingla 

(D) TOPOLOGYi linear 

(ii) MOLECULE TYPE* DNA (»ynthatic) 



(xi) SEQUENCE DESCRIPTION! SEQ ID NOs32s 
TCCATGCCTT TTCGTCCTAO CATTCCTCAT TCTCTTCCAG TGATTGCTCC TCCAACTCCT 
ACACCCGA 

(2) INFORMATION TOR SEQ ID NOi33r 

(i) SEQUENCE CHARACTERISTICS s 
<A> LENGTH* 69 baaa pairs 

(B) TYPSt nuclaic acid 

(C) STRANDEDNESSt aingla 

(D) TOPOLOGYx linear* 



60 
68 



(ii) MOLECULE TYPEs DNA (synthatic) 



(xi) SEQUENCE DESCRIPTION s SEQ ID NOx33s 
CCTAATCTAT GCTTTTGGTC ACGCTCATCT CCCTATCACA CCTGCTCCAA TGACTGCAAC 
TCTCGTCTC 



60 
69 
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Claims: 

1. An isolated double- stranded DNA molecule 
consisting essentially of DNA encoding a glyphosate 
oxidoreductase enzyme. 

2. A recombinant, double-stranded DNA 
molecule comprising in sequence: 

a) a promoter which functions in plants to cause 
the production of an UNA sequence; 

b) a structural DNA sequence that causes the 
production of an RNA sequence which encodes 
a glyphosate oxidoreductase enzyme; and 

c) a 3* non-translated region which functions in 
plants to cause the addition of polyadenylated 
nucleotides to the 3' end of the RNA sequence 

where the promoter is heterologous with respect to 
the structural DNA sequence and adapted to cause 
sufficient expression of said enzyme in plant tissue, 
including meristematic tissue, to enhance the 
glyphosate tolerance of a plant cell transformed with 
said gene. 

25 3. A DNA molecule of Claim 2 in which said 

structural DNA sequence encodes a fusion polypeptide 
comprising an amino- terminal chloroplast transit peptide and 
a glyphosate oxidoreductase enzyme. 

30 4. A DNA molecule of Claim 3 in which the 

promoter is a plant DNA virus promoter. 



15 



92041559 



^1992 DERWENT PIJBLUJATIUINS Liu 



WO 92/00377 



PCT/US91/04514 



- 119 - 

5. A DNA molecule of Claim 4 in which the 
promoter is selected from the group consisting of CaMV35S and 
FMV35S promoters. 

5 

6. A method of producing genetically 
transformed plants which are tolerant toward glyphosate 
herbicide, comprising the steps of: 

a) inserting into the genome of a plant cell a 
10 recombinant, double-stranded DNA molecule 

comprising: 
i) a promoter which functions in plant 
cells to cause the production of an RNA 
sequence* 

15 ii) a structural DNA sequence that causes 

the production of an RNA sequence 
which encodes a glyphosate oxido- 
reductase enzyme, 
iii) a 3* non-translated DNA sequence which 
20 functions in plant cells to cause the 

addition of polyadenylated nucleotides to 
the 3' end of the RNA sequence 
where the promoter is heterologous with respect to 
the structural DNA sequence and adapted to cause 
25 sufficient expression of said enzyme in plant tissue, 

including meristematic tissue, to enhance the 
glyphosate tolerance of a plant cell transformed with 
said gene; 

b) obtaining a transformed plant cell; and 
30 c) regenerating from the transformed plant cell a 

genetically transformed plant which has 
increased tolerance to glyphosate herbicide. 
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7. A method of Claim 6 in which said structural 
DNA sequence encodes a fusion polypeptide comprising an 
amino terminal chloroplast transit peptide and a glyphosate 

^ oxidoreductase enzyme. 

8. A method of Claim 7 in which the promoter is 
from a plant DNA virus. 

10 9 & method of Claim 8 in which the promoter is 

selected from the group consisting of CaMV35S and FMV35S 
promoters. 

10. A glyphosate tolerant plant cell comprising a 
15 DNA molecule of Claim 3. 

11. A glyphosate tolerant plant cell of Claim 10 in 
which the promoter is a plant DNA virus promoter. 

20 12. A glyphosate tolerant plant cell of Claim 11 in 

which the promoter is selected from the group consisting of 
CaMV35S and FMV35S promoters. 

13. A glyphosate tolerant plant cell of Claim 10 
25 selected from the group consisting of corn,, wheat, rice, 
soybean, cotton, sugarheet, oilseed rape, canola, flax, 
sunflower, potato, tobacco, tomato, alfalfa, lettuce, apple, poplar 
and pine. 

30 14. A glyphosate tolerant plant comprising plant 

cells of Claim 10. 

15. A glyphosate tolerant plant of Claim 14 in 
which the promoter is from a DNA plant virus promoter. 
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16. A glyphosate tolerant plant of Claim 15 in 
which the promoter is selected from the group consisting of 
CaMV35S and FMV35S promoters. 

17. A glyphosate tolerant plant of Claim 14 
selected from the group consisting of corn, wheat, rice, 
soybean, cotton, sugarbeet, oilseed rape, canola, flax, 
sunflower, potato, tobacco, tomato, alfalfa, lettuce, apple, poplar 
and pine. 



18. A method for selectively controlling weeds in 
a field containing a crop having planted crop seeds or plants 
comprising the steps of: 

a) planting said crop seeds or plants which are 
glyphosate tolerant as a result of a 
recombinant double-stranded DNA molecule 
being inserted into said crop seed or plant, 
said DNA molecule having: 
20 • i) a promoter which functions in plants to 

cause the production of an UNA 
sequence, 

ii) a structural DNA sequence that causes 
the production of an RNA sequence 

25 which encodes a glyphosate oxido- 

. reductase enzyme, 

iii) a 3' non-translated DNA sequence 
which functions in plants to cause the 
addition of polyadenylated nucleotides 

30 to the 3* end of the RNA sequence 

where the promoter is heterologous with 
respect to the structural DNA sequence and 
adapted to cause sufficient expression of said 
enzyme in plant tissue, including 
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meristematic tissue to enhance the 
glyphosate tolerance of a plant transformed 
with said gene; and 
b) applying to said crop and weeds in said field a 
sufficient amount of glyphosate herbicide to 
control said weeds without significantly 
affecting said crop. 

19. A method of Claim 18 in which said 
structural DNA sequence encodes an amino terminal 
chloroplast transit peptide and a glyphosate oxidoreductase 
enzyme. 

20. A method of Claim 19 in which the crop plant 
is selected from the group consisting of corn, wheat, rice, 
soybean, cotton, sugarbeet, oilseed rape, canola, flax, 
sunflower, potato, tobacco, tomato, lettuce, apple, poplar, pine 
and alfalfa. 

21. A DNA of Claim 1 which hybridizes to the 
DNA sequence of SEQ ID NO:3. 

22. A glyphosate oxidoreductase protein 
substantially free of other bacterial proteins comprising the 
amino acid sequence as set forth in SEQ ID NO:5. 

23. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
arginine. 

24. The glyphosate oxidoreductase protein of 
claim 22 wherein the 'amino acid residue at position 334 is 
lysine. 



920,41559 



w l992 DERWENT FUBL1CAT1UINS JL1U 



WOS2/00377 



PCT/US91/04514 



123 



25. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
glutamine. 

5 

26. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
alanine. 

10 27. The glyphosate oxidoreductase protein of 

claim 22 wherein the amino acid residue at position 334 is 
tryptophan. 

28. The glyphosate oxidoreductase protein of 
15 claim 22 wherein the amino acid residue at position 334 is 

isoleucine. 

29. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
leucine. 

30. The glyphosate oxidoreductase protein of 
claim 22 wherein the amino acid residue at position 334 is 
glutamic acid. 

31. A method for selecting transformed plant 

tissue comprising: 

introducing a gene encoding glyphosate 

oxidoreductase into plant tissue; 
30 placing said plant tissue on a plant growth 

media containing glyphosate; 

selecting plant tissue which exhibits growth 
on said glyphosate containing media. 
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32. The method of claim 31 further comprising 
the step of confirming the presence of said glyphosate 
oxidoreductase gene in said plant tissue by recallusing on 
glyphosate a segment of said plant tissue exhibiting growth on 
glyphosate containing media. 

33. A recombinant bacterium containing the 
DNA of claim 1. 

34. A glyphosate oxidoreductase enzyme 
catalyzing the oxidation of glyphosate to aminomethyl- 
phosphonate and glyoxylate. 
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Sspl (320840 . 

6358 TCA7CAMATATTTAGCAGCATTCCAGATTGGGTTCAA 

TCAACAAGGTACGAGCCATATCACTTTATTCAAAT7GG 

TATCGCCAAAACCMGAAGGAACTCCCATCCTCAAAGG 

TTTGTAAGGAAGAATTCTCAGTCCAAAGCCTCAACAAG 

GTCAGGGTACAGAGTCTCCAAACCATTAGCCAAAAGCT 

ACAGGAGATCAATGAAGAATCTTCAATCAAAGTAAACT 

AC7GTTCCAGCACATGCATCATGG7CAGTAAGTTTCAG 

AAAAAGACATCCACCGAAGACTTAAAG7TAGTGGGCAT 

CTTTGAAAGTAATCTTGTCAACATCGAGCAGCTGGCT7 

GTGGGGACCAGACAAAAAAGGAATGGTGCAGAATTGTT 

AGGCGCACCTACCAAAAGCATCTTTGCCTTTATTGCAA 

AAGATAMGCAGATTCCTCTAGTACAAGTGGGGAACAA 

AATAACGTGGAAAAGAGCTGTCCTGACAGCCCACTCAC 

TMTGCGTATGACGAACGCAGTGACGACCACAAAAGAA 

TTTTCCCTCTATATAAGAAGGCATTTCATJCCCATTTG 

AAGGATCATCAGATACTAACCAATATTTCTC 6954 

Sspl 
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1 NCATGGACGTCTGMCGAAATCG7CGTTACCGCAGCAAGGTAAGGCACGCCGAATTTTAT 
6 I CACCTACCGCGAAACGGTGGCTAGGCAGCGAGAGACTG7CGGCTCCGCGGGAGCATCCTA 

H 

121 TGTCTGAGAACCACAAAAAAGTAGGCATCGCTGGAGCCGGAATCGTCGGCGTATGCACGG 

SENHKKVG I AGAGI VGVCTA 
181 CGCTGATGCTTCAGCGCCGCG6ATTCAAAGTCACCTTGATTGACCCGAACCCTCCTGGCG 

LMLQRRGFKVTL 1 DPNPPGE 
241 AAGGTGCATCGTTTGGGAATGCCGGATGCTTCAACGGCTCATCCGTCGTCCCTATGTCCA 

GASFGNAGCFNGSSVVPH. SH 
301 TGCCGGGAAACTTGACGAGCGTGCCGAAGTGGCTCCTTGACCCGATGGGGCCGTTGTCAA 

PGNLTSVPKULLDPHGPLS1 
361 TCCGGTTCAGCTATTTTCCAACCATCATGCCC7GGTTGATTCGCTTTCTGTTAGCCGGAA 

rfsyfpti.mpwlirflla.gr 
421 gaccaaacaaggtgaaggagcaggcgaaagcactccgcaatctcatcaagtccacggtgc 
pnkvkeoak alrnlikstvp 

481 CTCTGATCAAGTCATTGGCGGAGGAGGCTGATGCGAGCCATCTGATCCGCCATGAAGGTC 
LIKSLAEEADASHLIRHEGH 

541 ATCTGACCGTATATCGTGGAGAAGCAGAlTrCGCCAAGGACCGCGGAGGTTGGGAACTGC 
LTVYRGEADFAKDRGGWELR 

601 GGCGTCTCAACGGTGmGCACGCAGATCCTCAGCGCCGATGCGTTGCGGGATTTCGATC 
RLNGVRTQILSADALRDFDP 

661 CGAACTTGTCGCATGCGmACCAAGG^ 

NLSH AFTKGILIEEMGH T^I ^ N 

721 ATCCGCAAGGGCTCGTGACCCTCTTGTnCGGCGTTTTATC^ 

781 TATCTGCGCGTGTCATCGGCTTTG^GACTGAAGGTAGGG^^ 

841 CGMcLgVcTG^ 

N G V L A V D A A V V A A G A^S K 
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901 TTGCTAATTCGCTAGGCGATGACATCCCGCTCGATACCGAACGTGGATATCATATCGTCA 

ANSLGDD I PLDTERGYHI V 1 
96 1 TCGCGAATCCGGAAGCCGCTCCACGCATTCCGACGACCGATGCGTCAGGAAAATTCATCG 

ANPEAAPR1PTTDASGKFIA 
1021 CGACACCTATGGAAATGGGGCTTCGCGTGGCGGGTACGGTTGAGTTCGCTGGGCTCACAG 

TPHEHGLRVAGTYEFAGLTA 
1 081 CCGCTCC7AACTGGAAACGTGCGCATGTGCTCTATACGCACGCTCGAAAACTTCTTCCAG 

APNVKRAHVLYTHARKLLPA 
114 1 CCCTCGCGCCTGCGAGTTCTGAAGAACGATATTCCAAATGGATGGGGTTCCGGCCGAGCA 

LAPASSEERYSKUMGFRPSI 
1201 TCCCG6ATTCGCTCCCCGTGATTGGCCGGGCAACCCGGACACCCGACGTAATCTATGC7T 

PDSLPV 1 GRATRTPDV 1YAF 

Ncol Socl 
1261 TCGGCCATGGTCATCTCGGCATGACAGGGGCGCCGATGACCGCAACGCTCGTCTCAGAGC 

GHGHLGMTGAPMTATLVSEL 
1321 TCCTCGCAGGCGAAAAGACCTCAATC6ACATTTCGCCCTTCGCACCAAACC6CTTT6GTA 
LAGEKTS IDISPFAPNRFGI 

Seal 

1381 TTGGCAAATCCAAGCAAACGGGtCCGGCAAGTTAAGTACTTACGCGGTCGTGAGTACAGC 
GKSKQTGP AS ■« 

G^GAGMGGTGGATTWTTC^ 
Xhol 

1681 GCCGTACTC6AG 

FIG.2B 
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fHe-t 

1 AGATCTCCATGGCTGAGAACCACAAAAAAGTAGGCATCGCTGGAGCCGGA 50 
— T 

51 ATCGTCGGCGTATGCACGGCGCTGATGCTTCAGCGCCGCGGATTCAAAGT 100 
T T T TT ATT 

101 CACCTTGATTGACCCGAACCCTCCTGGCGAAGGTGCATCGTTTGGGAATG 150 

151 CCGGATGCTTCAACGGCTCATicCGTCGTCCCTATGTCCATGCCGGGAAAC 200 

201 nGACGAGCGTGCCGMGTKCTCCTTGACCCGATGGGGCCGTTGTCAAT 250 

251 CCGGnCAGCTATnTCCAACCATCATGCCCTGGTTGATTCGCTTTCTGT 300 

301 TAGCCGGAAGACCAAACAAGGTGAAGGAGCAGGCGAAAGCACTCCGCAAT 350 

351 CTCATCAAGTCCACGGTGCCTCTGATCAAGTCATTGGCGGAGGAGGCTGA 400 

401 TGCGAGCCATCTGATCCGCCATGAAGGTCATCTGACCGTATATCGTGGAG 455 

451 AAGCAGACTTCGCCAAGGACCGCGGAGGTTGGCAACTGCGGCGTCTCAAC 500 

501 GGTGTTCGCACGCAGATCCTCAGCGCCGATGC6TTGCGGGATTTCGATCC 550 

TCT T T T 

551 GAACTTGTCGCATGCGTTTACCAA6GGCATTCTTATAGAAGAGAACGGTC 600 
T T 

92041559 
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601 ACACGATTAATCCGCAAGGGCTCGTGACCCTCTTGTTTC6GCGTTTTATC 650 

651 GCGMCGGTGGCGAATTTGTATCTGCGCGTG1CATCGGCTTTGAGACTGA 700 

T 

701 AGGTAGGGCGCTTAAAGGCATTACAACCACGAACGGCGTTCTGGCCGTTG 750 
C T T C T T T 

751 ATGCAGCGGTTGTCGCAGCCGGCGCACACTCGAAATCACTTGCTAATTCG BOO 
T T T; T T 

801 CTAGGCGATGACATCCCGCTCGATACCGAACGTGGATATCATATCGTCA7 850 

851 CGCGAATrcGGAAGCCGCTCCACGCATTCCGACGACCGATGCGTCAGGAA 900 

901 AAnCATCG^CACCTATGGAAATGGGGCTTCGCGTGGCGGGTACGGTT 950 

T T T T T 

951 GAGTTCGCTGGGCTCACAGCCGCTCX7AACTGGAAACGTGCGCATGTGCT 1000 
T T T 

1001 CTATACGCACGCTCGAAMCTTCnCCAGCCCTCGCGCCTGCGAGTTCTG 1050 

1051 AAGAACGATAnCCAAATGGATGGGGTT^GCC6AGCATCCCGGATTCG 1100 

U 01 CTCC(XGTGATTGGCCGGGCAACCCGGACACCCGACGTAATCTATGCTTT 1150 
T A T T T 1 

U51 CGGCCACGGTCATCTCGGCATGACAGGGGCGCCGATGACCGCAACGCTCG 1200 
XT T T T A T T 

1201 TCTCAGAGCTCCTCGCAGGCGAAMGACCTCAATCGACATTTCGCCCTTC 1250 
1251 GCACCAAACCGCTTTGGTATTGGCAAATCCAAGCAAACGGGTCCGGCAAG 1300 

1301 TTAAGTGGGAATTCAAGCTTG 1321 
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I AGATCTCCATGGCTGAGAACCACAAAAAAGTAGGCATCGCTGGAGCCGGA 50 

G G T T T 

51 ATCGTCGGCGTATGCACGGCGCTGATGCTTCAGCGCCGCGGATTCAAAGT 100 
TTT Tn ATT G 

1 0 I CACCTTGATTGACCCGAACCCTCCTGGCGAAGGTGCATCGTTTGGGAATG 150 
T TA AAT TTCTC 

151 CCGGATGCTTCAACGGCTCATCCGTCGTCCCTATGTCCATGCCGGGAAAC 200 
TT TC TTA A 

201 nGACGAGCGTGCC6AAGTGGCTCCTTGACCC6ATGGGGCCGTTGTCAAT 250 
T TAT A T A C 

251 CCGGnCAGCTATTTTCCAACCATCATGCCCTGGTTGATTEGCTrTCTGT 300 
T C T T CT C 

301 TAGCCGGAAGACCAAACAAGGTGAAGGAGCAGGCGAAAGCACTCCGCAAT 350 
j T A T G T C 

351 CTCATCAAGTCCACGGTGCCTCTGATCAAGTCATTGGCGGAGGAGGCT6A 400 
T T C T 

401 TGCGAGCCATCTGATCCGCCATGAAGGTCATCTGACCGTATATCGTGGAG 450 
T CT TC CT GC 

451 AAGCAGACTTCGCCAAGGACCGCGGAGGTTGGGAACTGCGGCGTCTCAAC 500 

501 GGTGTTCGCAC6CA6ATCCTCAGCGCCGATGCGTTGCGGGATTTCGATCC 550 

T T A T A T 

551 GAACTTGTCGCATGCGTTTACCAAGGGCATTCTTATAGAAGAGAACGGTC 600 
T T C C A C C 
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601 ACACGATTAATCCGCAAG6GCTCGTGACCCTCTTGTTTCGGCGTTTTATC 650 
C C C A T T T C 

651 GCGAACGGTGGCGAATTTGTATCTGCGC6TGTCATCGGCTTTGAGACTGA 700 
T A G C G T T AC 

701 AGGTAGGGCGCTTAAAGGCATTACAACCACGAACGGCGTTCTGGCCGTTG 750 
CTTCGTCC C T TT 

751 ATGCAGCGGTTGTCGCAGCCGGCGCACACTCGAAATCACTTGCTAATTCG 800 
T T TT CGT CC 

801 CTA6GCGATGACATCCCGCTCGATACCGAACGTGGATATCATATCGTCAT 850 
T T AT G C C G 

851 CGCGMTCCGGAAGCCGCTCCACGCATTCCGACGACCGATGCGTCAGGAA 900 
CCA T T AT TT 

901 MnCATCGC^CACCTATGGAAATGGGGCTTCGCGTGGCGGGTACGGTT 950 
G TT G T TTTAC 

951 GAGTTCGCTGGGCTCACAGCCGCTCCTAACTGGAAACGTGCGCATGTGCT 1000 
T T T G T C T 

1001 CTATACGCACGCTCGAAAACTTCTTCCAGCCCTCGCGCCTGCGAGTTCTG 1050 
C T T GT G T T C 

1051 AAGAACGATATTCCAAATGGATGGGGTTCCGGCCGAGCATCCCGGATTCG 1100 
TCG TTA AC 

1101 CTCCCCGTGATTGGCCGGGCAACCCGGACACCCGACGTAATCTATGCTTT 1150 
TA T T T TTA T C 

1151 CGGCCACGGTCATCTCG6CATGACAGGGGCGCCGATGACCGCAACGCTCG 1200 
f c T TTTA C 

1201 TCTCAGAGCTCCTCGCAGGCGAAAAGACCTCAATCGACATTTCGCCCTTC 1250 
j7 TG T CTA 

1251 GCACCAAACCGCTTTGGTATTGGCAAATCCAAGCAAACGGGTCCGGCAAG 1300 
T C T G T T TC 

1301 TTAAGTGGGAATTCAAGCTTG 1321 
C 
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B 

0 
I 
I 
I 

AGATCTCCACMTGGCTTCCTCTATGCTCTCTTCCGCTACTATGGTTGCCTC7CCGGCTC 
1 , 1 1 1 1 + 60 

TCTAGAG(H(rnACCGMGGAGATAC6AGAGAAGGCGATGATACCAACGGAGAGGCCGAG 

ci Me-MlaSerSerHeUeuSerSerAlaThrHetYalAlaSerProAlaGln - 

AGGCCACTATGGTCGCTCCTTTCAACGWCTTAAGTCCTCCGCTGCCTTCCCAGCCACCC 

61 1 1 1 1 1 + 120 

TCCGOTGATACCAGCGAGGAAAGTTGCCTGAATTCAGGAGGC6ACGGAAGGGTCGG7GGG 

c» AlaThrHe-tValAlftProPheAsnGlyLeuLysSerSerAlaAlaPheProAlaThrAro - 

GCAAGGCTAACAACGACAnACTTtXATWCMGCAACGGCGGAAGAGTTAACTGCATGC 

121 1 1 — I 1 • * 180 

CGTrCCGATTGTTGCTGTMTGMGGTAGTGTTCGnGCCGCCTTCTCAATTGACGTACG 

ci LysAlaAsnAsnAspIleThrSerlleThrSerAsnGlyGlyAroValAsnCysHe-tGIn - 

AGGTGTGGCCTCCGATTGGAAAGAAGAAGTTTGAGACTCTCTCTTACCTTCCTGACCTrA 

j 8l 1 -» — * 1 1 1 — ♦ 240 

TCCACACCGGAGGCTAACCTnETTCTTCAAACTCTGAGAGAGAATGGAAGGACTGGAAT 

ci ValTrpProProIleGlyLysLysLysPheGluThrLeuSerTyrLeuProAspLeuThp - 

N 



1 

CCGATTCCGGTGGTCGCGTCAACTGCATGCAGGCCATGG 

24l ■— h h— 1 279 

GGCTAAGGCCACCAGCGCAGTTGACGTACG7CCGGTACC 

AspSerGlyGlyArgYalAsnCysMe-tGlnAtaHe-t 
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B 

g 
i 
i 
I 

AGATCTATCGATAAGCTTGATGTAATTGGAGGAAGATCAAAATTTTCAATCCCCATTCTT 

1 1 1 1 1 1- + 60 

TCTAGATAGCTATTCGAACTACATTAACCTCCT7CTAGTTTTAAAAGTTAGGGGTAAGAA 

CGATTGCTTCAATTGAAGTTTCTCCGATGGCGCAAGT7AGCAGAATCTGCAATGGTGTGC 

61 — , 1 1 1 1 f + 120 

GCTAACGAAGTTAACTTCAAAGAGGCTACCGCGTTCAATCGTCTTAGACGTTACCACACG 

HetAlaGlnValSerApgl UCysAsnGlyValGln - 

AGAACCCATCTCTTATCTCCAATCTCTCGAAATCCA6TCAACGCAAATCTCCCTTATCGG 



121 



-+ 180 

TCTTGGGTAGAGAATAGAGGTTAGAGAGCTrTAGGTCAGTTGCGTTTAGAGGGAATAGCC 
ci AsnProSerLeuIleSepAsnLeuSerLysSepSerGlnArQLysSerProLeuSerVal - 

TTTCTCTGAAGACGCAGCAGCATCCACGAGCTTATCCGATTTCGTCGTCGTGGGGATTGA 

181 , l 1 i ' * 240 

AAAGAGACTTCTGCGTCGTCGTAGGTGCTCGAATAGGCTAAAGCAGCAGCACCCCTAACT 

ci SepLeuLysThpGtnGlnHlsProArQAlaTypPpoIleSepSepSepTppGlyLeuLys - 

AGAAGAGTGGGATGACGTTAATTGGCTCTGAGCTTCGTCCTCTTAAGGTCATGTCTTCTG 



241 



TCTTCTCACCCTACTGCAATTAACCGAGACTCGAAGCAGGAGAATTCCAGTACAGAAGAC 
LysSerGlyHe^ThpLeuIleGlyScpGluLeuAroPpoLeuLysyolHeiScpSepVal 



ci 



S 

P 
h 

I 

TTTCCACGGCGTGCATGC 

301 — - 1 

MAGGTGCCGCACGTACG 

SerThpAlaCysHc-t 
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